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Preface 


The fundamental theory that underlies the physicist’s description of the material 
world is quantum mechanics — specifically Erwin Schrodinger’s 1926 formula- 
tion of the theory. This theory also brought with it an emphasis on certain fields 
of mathematical analysis, e.g., Hilbert space theory, spectral analysis, differen- 
tial equations, etc., which, in turn, encouraged the development of parts of pure 
mathematics. 

Despite the great success of quantum mechanics in explaining details of the 
structure of atoms, molecules (including the complicated molecules beloved of 
organic chemists and the pharmaceutical industry, and so essential to life) and 
macroscopic objects like transistors, it took 41 years before the most fundamental 
question of all was resolved: Why doesn’t the collection of negatively charged 
electrons and positively charged nuclei, which are the basic constituents of the 
theory, implode into a minuscule mass of amorphous matter thousands of times 
denser than the material normally seen in our world? Even today hardly any 
physics textbook discusses, or even raises this question, even though the basic 
conclusion of stability is subtle and not easily derived using the elementary 
means available to the usual physics student. There is a tendency among many 
physicists to regard this type of question as uninteresting because it is not easily 
reducible to a quantitative one. Matter is either stable or it is not; since nature tells 
us that it is so, there is no question to be answered. Nevertheless, physicists firmly 
believe that quantum mechanics is a ‘theory of everything’ at the level of atoms 
and molecules, so the question whether quantum mechanics predicts stability 
cannot be ignored. The depth of the question is further revealed when it is realized 
that a world made of bosonic particles would be unstable. It is also revealed by 
the fact that the seemingly innocuous interaction of matter and electromagnetic 
radiation at ordinary, every-day energies — quantum electrodynamics — should be 
a settled, closed subject, but it is not and it can be understood only in the context 
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of perturbation theory. Given these observations, it is clearly important to know 
that at least the quantum-mechanical part of the story is well understood. 

It is this stability question that will occupy us in this book. After four decades 
of development of this subject, during which most of the basic questions have 
gradually been answered, it seems appropriate to present a thorough review of 
the material at this time. 

Schrodinger’s equation is not simple, so it is not surprising that some inter- 
esting mathematics had to be developed to understand the various aspects of the 
stability of matter. In particular, aspects of the spectral theory of Schrodinger 
operators and some new twists on classical potential theory resulted from this 
quest. Some of these theorems, which play an important role here, have proved 
useful in other areas of mathematics. 

The book is directed towards researchers on various aspects of quantum 
mechanics, as well as towards students of mathematics and students of physics. 
We have tried to be pedagogical, recognizing that students with diverse back- 
grounds may not have all the basic facts at their finger tips. Physics students 
will come equipped with a basic course in quantum mechanics but perhaps will 
lack familiarity with modern mathematical techniques. These techniques will 
be introduced and explained as needed, and there are many mathematics texts 
which can be consulted for further information; among them is [118], which we 
will refer to often. Students of mathematics will have had a course in real anal- 
ysis and probably even some basic functional analysis, although they might still 
benefit from glancing at [118]. They will find the necessary quantum-mechanical 
background self-contained here in chapters two and three, but if they need more 
help they can refer to a huge number of elementary quantum mechanics texts, 
some of which, like [77, 22], present the subject in a way that is congenial to 
mathematicians. 

While we aim for a relaxed, leisurely style, the proofs of theorems are either 
completely rigorous or can easily be made so by the interested reader. It is our 
hope that this book, which illustrates the interplay between mathematical and 
physical ideas, will not only be useful to researchers but can also be a basis for 
a course in mathematical physics. 

To keep things within bounds, we have purposely limited ourselves to the 
subject of stability of matter in its various aspects (non-relativistic and relativis- 
tic mechanics, inclusion of magnetic fields, Chandrasekhar’s theory of stellar 
collapse and other topics). Related subjects, such as a study of Thomas—Fermi 
and Hartree-Fock theories, are left for another day. 
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CHAPTER 1 


Prologue 


1.1. Introduction 


The basic constituents of ordinary matter are electrons and atomic nuclei. These 
interact with each other with several kinds of forces — electric, magnetic and 
gravitational — the most important of which is the electric force. This force 
is attractive between oppositely charged particles and repulsive between like- 
charged particles. (The electrons have a negative electric charge —e while the 
nuclei have a positive charge +Ze, with Z = 1, 2,..., 92 in nature.) Thus, the 
strength of the attractive electrostatic interaction between electrons and nuclei 
is proportional to Ze*, which equals Za in appropriate units, where a is the 
dimensionless fine-structure constant, defined by 


e 3 1 
a = — = 7.297 352538 x 10 ° = 


ea eee (1.1.1) 
he 137.035 999 68 


and where c is the speed of light, 4 = h/2z and h is Planck’s constant. 
The basic question that has to be resolved in order to understand the existence 
of atoms and the stability of our world is: 


Why don’t the point-like electrons fall into the (nearly) point-like nuclei? 


This problem of classical mechanics was nicely summarized by Jeans in 1915 
[97]: 


“There would be a very real difficulty in supposing that the (force) law 1/r? held 
down to zero values of r. For the force between two charges at zero distance 
would be infinite; we should have charges of opposite sign continually rushing 
together and, when once together, no force would be adequate to separate 
them... Thus the matter in the universe would tend to shrink into nothing or 
to diminish indefinitely in size.” 
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A sensitive reader might object to Jeans’ conclusion on the grounds that 
the non-zero radius of nuclei would ameliorate the collapse. Such reasoning 
is beside the point, however, because the equilibrium separation of charges 
observed in nature is not the nuclear diameter (10~!* cm) but rather the atomic 
size (10-8 cm) predicted by Schrédinger’s equation. Therefore, as concerns 
the problem of understanding stability, in which equilibrium lengths are of the 
order of 10-8 cm, there is no loss in supposing that all our particles are point 
particles. 

To put it differently, why is the energy of an atom with a point-like nucleus 
not —oo? The fact that it is not is known as stability of the first kind; a more 
precise definition will be given later. The question was successfully answered 
by quantum mechanics, whose exciting development in the beginning of the 
twentieth century we will not try to relate — except to note that the basic theory 
culminated in Schrodinger’s famous equation of 1926 [156]. This equation 
explained the new, non-classical, fact that as an electron moves close to a nucleus 
its kinetic energy necessarily increases in such a way that the minimum total 
energy (kinetic plus potential) occurs at some positive separation rather than at 
zero separation. 


This was one of the most important triumphs of quantum mechanics! 


Thomson discovered the electron in 1897 [180, 148], and Rutherford [155] 
discovered the (essentially) point-like nature of the nucleus in 1911, so it took 
15 years from the discovery of the problem to its full solution. But it took almost 
three times as long, 41 years from 1926 to 1967, before the second part of the 
stability story was solved by Dyson and Lenard [44]. 

The second part of the story, known as stability of the second kind, is, even 
now, rarely told in basic quantum mechanics textbooks and university courses, 
but it is just as important. Given the stability of atoms, is it obvious that bulk 
matter with a large number N of atoms (say, N = 107%) is also stable in the 
sense that the energy and the volume occupied by 2/N atoms are twice that of NV 
atoms? Our everyday physical experience tells us that this additivity property, or 
linear law, holds but is it also necessarily a consequence of quantum mechanics? 
Without this property, the world of ordinary matter, as we know it, would not 
exist. 

Although physicists largely take this property for granted, there were a few 
that thought otherwise. Onsager [145] was perhaps the first to consider this 
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kind of question, and did so effectively for classical particles with Coulomb 
interactions but with the addition of hard cores that prevent particles from 
getting too close together. The full question (without hard cores) was addressed 
by Fisher and Ruelle in 1966 [66] and they generalized Onsager’s results to 
smeared out charges. In 1967 Dyson and Lenard [44] finally succeeded in 
showing that stability of the second kind for truly point-like quantum particles 
with Coulomb forces holds but, surprisingly, that it need not do so. That is, 
the Pauli exclusion principle, which will be discussed in Chapter 3, and which 
has no classical counterpart, was essential. Although matter would not collapse 
without it, the linear law would not be satisfied, as Dyson showed in 1967 [43]. 
Consequently, stability of the second kind does not follow from stability of the 
first kind! If the electrons and nuclei were all bosons (which are particles that 
do not satisfy the exclusion principle), the energy would not satisfy a linear 
law but rather decrease like —N7/°; we will return to this astonishing discovery 
later. 

The Dyson—Lenard proof of stability of the second kind [44] was one of 
the most difficult, up to that time, in the mathematical physics literature. A 
challenge was to find an essential simplification, and this was done by Lieb and 
Thirring in 1975 [134]. They introduced new mathematical inequalities, now 
called Lieb-Thirring (LT) inequalities (discussed in Chapter 4), which showed 
that a suitably modified version of the 1927 approximate theory of Thomas and 
Fermi [179, 62] yielded, in fact, a lower bound to the exact quantum-mechanical 
answer. Since it had already been shown, by Lieb and Simon in 1973 [129, 130], 
that this Thomas—Fermi theory possessed a linear lower bound to the energy, the 
many-body stability of the second kind immediately followed. 

The Dyson—Lenard stability result was one important ingredient in the solution 
to another, but related problem that had been raised many years earlier. Is it true 
that the ‘thermodynamic limit’ of the free energy per particle exists for an infinite 
system at fixed temperature and density? In other words, given that the energy 
per particle of some system is bounded above and below, independent of the size 
of the system, how do we know that it does not oscillate as the system’s size 
increases? The existence of a limit was resolved affirmatively by Lebowitz and 
Lieb in 1969 [103, 116], and we shall give that proof in Chapter 14. 

There were further surprises in store, however! The Dyson—Lenard result was 
not the end of the story, for it was later realized that there were other sources 
of instability that physicists had not seriously thought about. Two, in fact. The 
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eventual solution of these two problems leads to the conclusion that, ultimately, 
stability requires more than the Pauli principle. It also requires an upper bound 
on both the physical constants w and Za.! 

One of the two new questions considered was this. What effect does Einstein’s 
relativistic kinematics have? In this theory the Newtonian kinetic energy of an 
electron with mass m and momentum p, p*/2m, is replaced by the much weaker 
J p2c2 + m2c4 — mc*. So much weaker, in fact, that the simple atom is stable 
only if the relevant coupling parameter Za is not too large! This fact was known 
in one form or another for many years — from the introduction of Dirac’s 1928 
relativistic quantum mechanics [39], in fact. It was far from obvious, therefore, 
that many-body stability would continue to hold even if Za is kept small (but 
fixed, independent of NV). Not only was the linear N-dependence in doubt but 
also stability of the first kind was unclear. This was resolved by Conlon in 1984 
[32], who showed that stability of the second kind holds if a < 10-2 and 
Z=1. 

Clearly, Conlon’s result needed improvement and this led to the invention of 
interesting new inequalities to simplify and improve his result. We now know 
that stability of the second kind holds if and only if both a and Za are not too 
large. The bound on q@ itself was the new reality, previously unknown in the 
physics literature. 

Again new inequalities were needed when it was realized that magnetic fields 
could also cause instabilities, even for just one atom, if Za? is too large. The 
understanding of this strange, and totally unforeseen, fact requires the knowl- 
edge that the appropriate Schrodinger equation has ‘zero-modes’, as discovered 
by Loss and Yau in 1986 [139] (that is, square integrable, time-independent 
solutions with zero kinetic energy). But stability of the second kind was still 
open until Fefferman showed in 1995 [57, 58] that stability of the second kind 
holds if Z = 1 and @ is very small. This result was subsequently improved to 
robust values of Za? and a by Lieb, Loss and Solovej in 1995 [123]. 

The surprises, in summary, were that stability of the second kind requires 
bounds on the fine-structure constant and the nuclear charges. In the relativistic 
case, smallness of a and of Za@ is necessary, whereas in the non-relativistic case 
with magnetic fields, smallness of a and of Za is required. 


' If Z > 1, which it always is in nature, a bound on Za implies a bound on a, of course. The 
point here is that the necessary bound on a@ is independent of Z, even if Z is arbitrarily small. 
In this book we shall not restrict our attention to integer Z. 
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Given these facts, one can ask if the simultaneous introduction of relativistic 
mechanics, magnetic fields, and the quantization of those fields in the man- 
ner proposed by M. Planck in 1900 [149], leads to new surprises about the 
requirements for stability. The answer, proved by Lieb, Loss, Siedentop and 
Solovej [127, 119], is that in at least one version of the problem no new con- 
ditions are needed, except for expected adjustments of the allowed bounds for 
Za and a. 

While we will visit all these topics in this book, we will not necessarily follow 
the historical route. In particular, we will solve the non-relativistic problem 
by using the improved inequalities invented to handle the relativistic problem, 
without the introduction of Thomas—Fermi theory. The Thomas—Fermi story 
is interesting, but no longer essential for our understanding of the stability of 
matter. Hence we will mention it, and sketch its application in the stability of 
matter problem, but we will not treat it thoroughly, and will not make further 
use of it. Some earlier pedagogical reviews are in [108, 115]. 


1.2 Brief Outline of the Book 


An elementary introduction to quantum mechanics is given in Chapter 2. It is a 
thumbnail sketch of the relevant parts of the subject for readers who might want 
to refresh their memory, and it also serves to fix notation. Readers familiar with 
the subject can safely skip the chapter. 

Chapter 3 discusses the many-body aspects of quantum mechanics and, in 
particular, introduces the concept of stability of matter in Section 3.2. The 
chapter also contains several results that will be used repeatedly in the chapters 
to follow, like the monotonicity of the ground state energy in the nuclear charges, 
and the fact the bosons have the lowest possible ground state energy among all 
symmetry classes. 

A detailed discussion of Lieb-Thirring inequalities is the subject of 
Chapter 4. These inequalities play a crucial role in our understanding of stability 
of matter. They concern bounds on the moments of the negative eigenvalues of 
Schrodinger type operators, which lead to lower bounds on the kinetic energy of 
many-particle systems in terms of the corresponding semiclassical expressions. 
This chapter, like Chapters 5 and 6, is purely mathematical and contains analytic 
inequalities that will be applied in the following chapters. 

Electrostatics is an old subject whose mathematical underpinning goes back 
to Newton’s discussion in the Principia [144] of the gravitational force, which 
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behaves in a similar way except for a change of sign from repulsive to attrac- 
tive. Nevertheless, new inequalities are essential for understanding many-body 
systems, and these are given in Chapters 5 and 6. The latter chapter contains a 
proof of the Lieb—Oxford inequality [125], which gives a bound on the indirect 
part of the Coulomb electrostatic energy of a quantum system. 

Chapter 7 contains a proof of stability of matter of non-relativistic fermionic 
particles. This is the same model for which stability was first shown by Dyson 
and Lenard [44] in 1967. The three proofs given here are different and very 
short given the inequalities derived in Chapters 4—6. As a consequence, matter 
is not only stable but also extensive, in the sense that the volume occupied is 
proportional to the number of particles. The instability of the same model for 
bosons will also be discussed. 

The analogous model with relativistic kinematics is discussed in Chapter 8, 
and stability for fermions is proved for a certain range of the parameters a 
and Za. Unlike in the non-relativistic case, where the range of values of these 
parameters was unconstrained, bounds on these parameters are essential, as 
will be shown. The proof of stability in the relativistic case will be an important 
ingredient concerning stability of the models discussed in Chapters 9, 10 and 11. 

The influence of spin and magnetic fields will be studied in Chapter 9. If 
the kinetic energy of the particles is described by the Pauli operator, it becomes 
necessary to include the magnetic field energy for stability. Again, bounds on 
various parameters become necessary, this time a and Za’. It turns out that zero 
modes of the Pauli operator are a key ingredient in understanding the boundary 
between stability and instability. 

If the kinetic energy of relativistic particles is described by the Dirac operator, 
the question of stability becomes even more subtle. This is the content of Chap- 
ter 10. For the Brown—Ravenhall model, where the physically allowed states are 
the positive energy states of the free Dirac operator, there is always instability 
in the presence of magnetic fields. Stability can be restored by appropriately 
modifying the model and choosing as the physically allowed states the ones that 
have a positive energy for the Dirac operator with the magnetic field. 

The effects of the quantum nature of the electromagnetic field will be inves- 
tigated in Chapter 11. The models considered are the same as in Chapters 9 
and 10, but now the electromagnetic field will be quantized. These models are 
caricatures of quantum electrodynamics. The chapter includes a self-contained 
mini-course on the electromagnetic field and its quantization. The stability and 
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instability results are essentially the same as for the non-quantized field, except 
for different bounds on the parameter regime for stability. 

How many electrons can an atom or molecule bind? This question will be 
addressed in Chapter 12. The reason for including it in a book on stability of 
matter is to show that for a lower bound on the ground state energy only the 
minimum of the number of nuclei and the number of electrons is relevant. A 
large excess charge can not lower the energy. 

Once a system becomes large enough so that the gravitational interaction 
can not be ignored, stability fails. This can be seen in nature in terms of the 
gravitational collapse of stars and the resulting supernovae, or as the upper mass 
limit of cold stars. Simple models of this gravitational collapse, as appropriate 
for white dwarfs and neutron stars, will be studied in Chapter 13. In particular, 
it will be shown how the critical number of particles for collapse depends on 
the gravitational constant G, namely G~*/* for fermions and G~! for bosons, 
respectively. 

The first 13 chapters deal essentially with the problem of showing that the 
lowest energy of matter is bounded below by a constant times the number of 
particles. The final Chapter 14 deals with the question of showing that the 
energy is really proportional to the number of particles, i.e., that the energy per 
particle has a limit as the particle number goes to infinity. Such a limit exists 
not only for the ground state energy, but also for excited states in the sense that 
at positive temperature the thermodynamic limit of the free energy per particle 
exists. 


CHAPTER 2 


Introduction to Elementary 
Quantum Mechanics and Stability 
of the First Kind 


In this second chapter we will review the basic mathematical and physical 
facts about quantum mechanics and establish physical units and notation. Those 
readers already familiar with the subject can safely jump to the next chapter. 

An attempt has been made to make the presentation in this chapter as elemen- 
tary as possible, and yet present the basic facts that will be needed later. There 
are many beautiful and important topics which will not be touched upon such as 
self-adjointness of Schrodinger operators, the general mathematical structure of 
quantum mechanics and the like. These topics are well described in other works, 
e.g., [150]. 

Much of the following can be done in a Euclidean space of arbitrary dimension, 
but in this chapter the dimension of the Euclidean space is taken to be three — 
which is the physical case — unless otherwise stated. We do this to avoid confusion 
and, occasionally, complications that arise in the computation of mathematical 
constants. The interested reader can easily generalize what is done here to the 
R?, d > 3 case. Likewise, in the next chapters we mostly consider N particles, 
with spatial coordinates in R°, so that the total spatial dimension is 3N. 


2.1 A Brief Review of the Connection Between Classical and 
Quantum Mechanics 


Considering the range of validity of quantum mechanics, it is not surprising 
that its formulation is more complicated and abstract than classical mechanics. 
Nevertheless, classical mechanics is a basic ingredient for quantum mechanics. 
One still talks about position, momentum and energy which are notions from 
Newtonian mechanics. 

The connection between these two theories becomes apparent in the semi- 
classical limit, akin to passing from wave optics to geometrical optics. In its 
Hamiltonian formulation, classical mechanics can be viewed as a problem 
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of geometrical optics. This led Schrodinger to guess the corresponding wave 
equation. We refrain from fully explaining the semiclassical limit of quantum 
mechanics. For one aspect of this problem, however, the reader is referred to 
Chapter 4, Section 4.1.1. 

We turn now to classical dynamics itself, in which a point particle is fully 
described by giving its position x = (x!, x”, x*) in R° and its velocity v = 
dx /dt = x in R? at any time r, where the dot denotes the derivative with respect 
to time.! Newton’s law of motion says that along any mechanical trajectory its 
acceleration v = X satisfies 


mx = F(x,x,t), (2.1.1) 


where F is the force acting on the particle and m is the mass. With F(x, x, ft) 
given, the expression (2.1.1) is a system of second order differential equations 
which together with the initial conditions x(to) and v(to) = x(to) determine x(t) 
and thus v(t) for all times. If there are N particles interacting with each other, 
then (2.1.1) takes the form 


m;xX; = F;, i=l,...,N, (2.1.2) 


where F;; denotes the sum of all forces acting on the i™ particle and x; denotes 
the position of the i particle. As an example, consider the force between two 
charged particles, whose respective charges are denoted by Q; and Qo, namely 
the Coulomb force given (in appropriate units, see Section 2.1.7) by 


XxX; —X2 


Fi; = QiQ2 —F». (2.1.3) 


|x, — x2|9 
If Q;Q> is positive the force is repulsive and if QQ» is negative the force 


is attractive. Formula (2.1.3) can be written in terms of the potential energy 
function 


QO; Q2 


V(x1,X2) = Pe, 
1—X2 


(2.1.4) 


noting that 
F,=-V;,V and Fy, =—V,,V. (2.1.5) 
As usual, we denote the gradient by V = (0/dx!, 0/0x7, 0/dx°). 


' We follow the physicists’ convention in which vectors are denoted by boldface letters. 
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2.1.1 Hamiltonian Formulation 


Hamilton’s formulation of classical mechanics is the entry to quantum physics. 
Hamilton’s equations are 


t=—, p=-— (2.1.6) 


where H(x, p) is the Hamilton function and p the canonical momentum of 
the particle. Assuming that 


F(x) = —VV(x) @A1.7) 


for some potential V then, in the case that the canonical momentum is given by 


p=mv, (2.1.8) 
Eq. (2.1.6) with 
Pr 
H=——+4V(x) (2.1.9) 
2m 
yields (2.1.1). The function 
2 
P 
T(p) = — (2.1.10) 
2m 


is called the kinetic energy function. A simple computation using Eq. (2.1.6) 
shows that along each mechanical trajectory the function H(x(t), p(t)) is a 
constant which we call the energy, F. 


2.1.2. Magnetic Fields 


Not in all cases is the canonical momentum given by (2.1.8). An example is the 
motion of a charged particle of mass m and charge —e in a magnetic field B(x) 
in addition to a potential, V(x). The Lorentz force on such a particle located at 
x and having velocity v is” 


e 
Frorentz = ees (2.1.11) 


? We use the symbol / for the vector product on R°, instead of x, since the latter may be confused 
with x. 
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The Hamilton function is then given by* 


1 2 
H(x, p) = = (p Es ~A(x)) + V(x), (2.1.12) 


where A(x), called the vector potential, determines the magnetic field (more 
properly called the magnetic induction) B(x) by the equation 


V A A(x) = curl A(x) = B(x). (2.1.13) 


The fact that an arbitrary magnetic field can be written this way as a curl is 
a consequence of the fact that Maxwell’s equations dictate that all physical 
magnetic fields satisfy 


V - B(x) = div B(x) = 0. (2.1.14) 


The parameter c in (2.1.12) is the speed of light, which equals 299792 458 
meters/sec. 
The canonical momentum p is now not equal to mass times velocity but rather 


mv = p+ A(x). (2.1.15) 


It is a simple calculation to derive the Lorentz law for the motion of an electron in 
an external magnetic field using (2.1.6) with (2.1.12) as the Hamilton function. 

The energy associated with this B field (i.e., the amount of work needed to 
construct this field or, equivalently, the amount of money we have to pay to the 
electric power company) is* 


Emag(B) = = | 1Benrax. (2.1.16) 
R3 


The units we use are the conventional absolute electrostatic units. For further 
discussion of units see Section 2.1.7. 


3 We note that we use the convention that the electron charge equals —e, with e > 0, and hence 
the proper form of the kinetic energy is given by (2.1.12). In the formula (p — eA(x)/c)?/(2m), 
which is usually found in textbooks, e denotes a generic charge, which can be positive or 
negative. 

4 The equation a := b (or b =: a) means that a is defined by b. 
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Since the only requirement on A is that it satisfy (2.1.13) we have a certain 
amount of freedom in choosing A. It would appear that the A has three degrees 
of freedom (the three components of the vector A) but in reality there are only 
two since B has only two degrees of freedom (because div B = 0). If we assume 
that Emag(B) < oo (which should be good enough for physical applications) then 
we can choose the field A such that (2.1.13) holds and 


divA =O and [iAcoiar < Od. (2.1.17) 
R3 


A proof of this fact is given in Lemma 10.1 in Chapter 10. The condition 
div A = 0 is of no importance to us until we get to Chapter 10. All results prior 
to Chapter 10 hold irrespective of this condition. Its relevance is explained in 
Section 10.1.1 on gauge invariance. 


2.1.3. Relativistic Mechanics 


It is straightforward to describe relativistic mechanics in the Hamiltonian for- 
malism. The Hamilton function for a free relativistic particle is 


Tre(P) := cy p? + m2c? — me’, (2.1.18) 


from which the relation between p and the velocity, v, is found to be 


j= OTrei(p) _ cp 
Op J pe + mc2 
Note that |v| < c. We can, of course, include a magnetic field in this relativistic 
formalism simply by replacing p by p + £A(x). 
A potential can be added to this T,.,() so that the Hamilton function becomes 


Are(p, X) = Trei(p) + V(x). (2.1.20) 


(2.1.19) 


Hamilton’s equations (2.1.6) then yield a mathematically acceptable theory, but 
it has to be admitted that it is not truly a relativistic theory from the physical 
point of view. The reason is that the theory obtained this way is not invariant 
under Lorentz transformations, i.e., the equations of motion (and not merely 
the solutions of the equations) are different in different inertial systems. We 
shall not attempt to explain this further, because we shall not be concerned 
with true relativistic invariance in this book. In any case, ‘energy’ itself is not 
a relativistically invariant quantity (it is only a component of a 4-vector). We 
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shall, however, be concerned with the kind of mechanics defined by the Hamilton 
function Hye in (2.1.20) because this dynamics is an interesting approximation 
to a truly relativistic mechanics. 


2.1.4 Many-Body Systems 


There is no difficulty in describing many-body systems in the Hamiltonian 
formalism — with either relativistic or non-relativistic kinematics. As an example, 
consider the problem of N electrons and M static nuclei interacting with each 
other via the Coulomb force. The electrons have charge —e, and are located 


at positions X = (x1,...,Xy), X; € R? fori =1,...,N. The M nuclei have 
charges eZ = e(Z;,..., Zy) and are located at R = (Ri,..., Ry) with R; € 
R? for i=1,...,M. Then the potential energy function of this system is 
e*Vc(X, R), with 
Vco(X, R) = W(X, R)+1(X) + U(R), (2.1.21) 
where 
N M Zz. 
W(X, R) = — —— 2.1.22 
(X 2 Lie Ri (2.1.22) 
1 
I(X) = or 2.1.23 
(X) 2d; =a (2.1.23) 
1<i<j<N : 
ZZ; 
U(R) = el 2.1.24 
(R) 5, RR (2.1.24) 


The three terms have the following meaning: W(X, R) is the electron—nucleus 
attractive Coulomb interaction, /(X) is the electron—electron repulsive inter- 
action and U(R) is the nucleus—nucleus repulsive interaction. The total force 
acting on the i" electron is thus given by 


F, = —€°Vx,Vc(X, R). (2.1.25) 
The Hamilton function is the sum of kinetic energy and potential energy 


H(X, P) = T(P) + e’Vc(X, R), (2.1.26) 
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where 


N 


2 
p ‘i 
TS) 2) (P= Oiycces 2127 
(P) dan P= (p,,---. Py) (2.1.27) 
in the non-relativistic case (or with the obvious change in the relativistic case). If 
the nuclei are dynamic, one also has to add the nuclear kinetic energy, of course. 

In the case of static nuclei, R are simply fixed parameters. We point out 
that when we study stability of the quantum analogue of this system, it will be 


essential to look for bounds that are independent of R. 


2.1.5 Introduction to Quantum Mechanics 


On atomic length scales, position and momentum can no longer describe the state 
of a particle. They both play an important role as observables but to describe 
the state of a quantum mechanical particle one requires a complex valued 
function yy : R? — C, called the wave function. In the remainder of the present 
chapter we limit the discussion to a single particle. The discussion of N-particle 
wave functions, y : R°“ —> C is deferred to Chapter 3. 

In order to fix the state of an electron one has to specify infinitely many 
numbers (i.e., a whole function) — not just the six numbers p and x of classical 
mechanics. The function x +> |y(x)|? is interpreted as a probability density and 
hence we require the normalization condition 


i |w(x)|?dx = 1. (2.1.28) 
R3 


The classical energy is replaced by an energy functional, E(y), of the wave 
function of the system: 


E(w) = Ty + Vy, (2.1.29) 


where 


h2 
= =. f wyieyPas, (2.1.30) 
R3 
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and 


Vy = f vepiwenrer. (2.1.31) 


R3 


The functional Ty is called the expectation value of the kinetic energy or, in 
short, the kinetic energy of wy. Similarly, Vy is called the expectation value 
of the potential energy or, in short, the potential energy of i. The constant h 
(pronounced h-bar) is 


h 
h= aa 1.055 x 10-” grams cm? sec”, 
1 


2 1 


where h = 6.626 x 10-7” grams cm? sec™! is a constant of nature called 
Planck’s constant. 

A comparison of (2.1.30) and (2.1.10) shows that the transition from classical 
to quantum mechanics is accomplished by replacing the classical momentum p 
by the operator —ihV. We shall frequently denote —ihV by p. In this notation 


we have 


= 1 2 
Ty = =~ | KpwyeyPae. 
R3 


Note that for a complex vector v such as pw(x), the quantity |v|* denotes the 
sum of the squares of the absolute values of the components. 

Associated with the kinetic energy (2.1.30) is the operator, called the free 
Hamilton operator, or simply the free Hamiltonian for brevity, 


3 
(2.1.32) 
and associated with € is the Hamiltonian 

H=H+V 


which acts on functions yw by 


2 


h 
(Hy )(x) = 5 OP) + V(x)y(x). (2.1.33) 


16 Introduction to Quantum Mechanics 


The derivatives in (2.1.33) can be taken to be in the distributional sense. For 
nice functions y 


EW) =(y, Hy), (2.1.34) 


where the inner product of f and g is defined by 


(fa) f Farear. (2.1.35) 
R3 


(The notation f denotes the complex conjugate, sometimes also written as f*.) 
Again, for nice functions w and @¢, 


(vo) = | TorHoyxyar 
R3 


— = 
7 / (Fe) -(vo2) + V(xyP6(x)) dx. (2.1.36) 
R3 


One of the technical problems in quantum mechanics is that the left side of 
(2.1.36) does not always make sense for arbitrary ¢’s. For the purpose of this 
book we shall always interpret (Ww, H@) as the right side of (2.1.36), which is 
well defined if w and ¢ are in H!(R°).° Note that Ty, is always positive and is 
always well defined for yw € H '(R3). See [164]. 

Returning to the Coulomb law in (2.1.3), we see that the energy function for 
a hydrogenic atom is given by 


h Ze? 
Ew) = i (Fivvw? = =< 1o?) dx (2.1.37) 


e 
|x | 
R3 


The functions that appear in quantum mechanics are not always differentiable in the classical 

sense. To define the following concepts rigorously it is important to know what a distributional 
derivative is. These matters are discussed fully elsewhere, see, e.g., [118]. Nevertheless, the 
reader who is willing to accept things on faith can just assume that all derivatives are classical 
and will still be able to follow the presentation. 
The most important spaces relevant for this book are L*(IR“) consisting of functions f(x) such 
that || f(]3 = Spa | f(x)/?dx < oo, and H'(R¢), which consists of functions that are square 
integrable and whose distributional derivatives are also square integrable functions. Again, we 
refer to [118] for further details. 
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and the corresponding Hamiltonian is 


PSe Kae, (2.1.38) 


We refer to this as a hydrogenic atom because it is the Hamiltonian for a one- 
electron atom; hydrogen corresponds to the case Z = 1. It will be discussed 
further in Section 2.2.2. 

A convenient way of rewriting the kinetic energy of a function y € L?(R“) 
(for any d > 1) is via Fourier transforms. Recall that the Fourier transform of 
a function (x) is, formally, defined as’ 


Wk) = / W(xje 27 kay, (2.1.39) 
Rd 


(We say ‘formally’ since the integral is absolutely convergent only if y € 
L'(R¢). Nevertheless, yw is well defined for all yw € L7(R®). See [118] for 
details.) Then y is given in terms of the inverse Fourier transform as 


W(x) = / wk" "dk. 
Rd 


Recall also Plancherel’s identity, 


[ iweorax = | iPayPar, (2.1.40) 
R¢ R¢ 


The (non-relativistic) kinetic energy of a function y can then be expressed as 


[vues = [one?iFaprar. (2.1.41) 
Rd Rd 


So any yw € L?(R%) is also in H'(R%) if and only if the right side of (2.1.41) 
is finite. Putting it differently, the operator p acts as multiplication by 27k in 
Fourier space, and y € L?(R%) is in H!(R®) if and only if |k|y(k) is in L7(R2). 


7 A different convention for the Fourier transform that is often used is vk) = 
Qry 4? faa (ede * de. 
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Using Fourier space, it is straightforward to define the relativistic kinetic 
energy (2.1.18) in the quantum case. Namely, 


(Wy, cp? + m2e2 W) = / cy (2k)? + m2c2|h(k)|2dk. (2.1.42) 
R3 


The space of L*(R“) functions for which the right side of (2.1.42) is finite is 
called H'/?(R¢), for obvious reasons. Again, see [118] for further details. 

Magnetic fields can be introduced in quantum mechanics in the same way 
as in classical mechanics. The Hamilton function (2.1.12) of one particle in a 
magnetic field becomes 


a= (p+ £Aw) +V@) (2.1.43) 
2m Cc = 


in the quantum case, with p = —ihV, as before. One can also consider magnetic 
fields together with relativity, in which case the kinetic energy becomes 


el (p + Ae), hit, (2.1.44) 
Cc 


The square root of a positive operator can be defined via the spectral theorem. 
(See, e.g., [150].) Alternatively, it can also be defined in terms of the Green’s 
function, or resolvent, [(p + (e/c)A(x))? + t]~! for t > 0. In fact, the formula 


1 ra x dt 
ve=— f 
0 


valid for x > 0, can be used to define the square root of any non-negative 


operator, such as (p + £A(x))” +m?c?. 


2.1.6 Spin 


Elementary particles have an internal degree of freedom called spin which 
is characterized by a specific number that can take one of the values S = 
0, 1/2,1,3/2,.... (S = 0 is usually called ‘spinless’.) A particle with spin 
S carries with it an internal Hilbert space of dimension 2S + 1, that is, its 
wave function is an element of L?(IR*) @ C75+! =: L?(R?; C*5+!), For example, 
electrons are spin 1/2 particles and what this means is that the wave function is 
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really a pair of ordinary complex-valued functions 


w(x,o), o =1,2. (2.1.45) 
Another way of writing this is a two-component vector called a spinor 
o) 
x)= . 2.1.46 
w(x) ( co (2.1.46) 
The normalization condition is now given by 
[ove wands =1, 2.1.47) 
R3 
where 
(vy, 6) = Vidi + Wodo (2.1.48) 


is the inner product on C?. 

In the absence of magnetic fields, the kinetic energy acts separately on each 
of the two components of y in a manner similar to the normalization condition, 
1.€., 


Ty = Ty, + Typ: (2.1.49) 


The second formulation (2.1.46) is convenient for discussing spin 1/2 in terms 
of the three Pauli matrices 0 = (o!, 0”, 03), with® 


1_(9 1 2_ {90 —-i 3_ (1 0 
Ge ale ee le PE a CLD) 


The following relations are easily verified: 

olok =-o*o/, for j #k (2.1.51) 
and 
igk =jo', (2.1.52) 


ox 


where the indices (j,k, /) are any cyclic permutation of the numbers (1, 2, 3). 
Given a spinor we can form the three dimensional vector 


(Wow) =(bo'W), (Wo?) (Wo? )). @153) 


8 The reader should not confuse the three Pauli matrices o with the integers o labeling the spin 
components. 
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In terms of the Pauli matrices, the angular momentum operators associated with 
the electron spin are S = (h/2)o. This is discussed in every standard quantum 
mechanics textbook and is not important for us here. 

In the presence of a magnetic field, the kinetic energy of an electron has to be 
modified as” 


1 e 2 
al (e+ ~AG)) v| dx +10 f ho - B(x)yy)dx. (2.1.54) 
R3 R3 
The constant 19 is called the Bohr magneton and is given by 


he 21 1 
Lo = ae = 9.274 x 10 “erg gauss ~~. (2.1.55) 


The second term in (2.1.54) is called the Zeeman energy. Using the electron spin 
operators, it can alternatively by written as the expectation value of (e/mc)S - 
B(x). It is very important in our daily lives since it is responsible for the 
magnetization of a piece of iron. 

The reader might wonder for which y and A the expression (2.1.54) is well 
defined. If we assume that each component of the vector potential A is in L? (R?) 
then the first term in (2.1.54) is defined for all those functions y € L7(R*) with 
(d,; +i(e/he)A/)w € L?(R°) for j = 1, 2,3. They forma function space which 
is denoted by H ACR?). For further properties of this space see [118]. 

There is no difficulty in extending the above definition to the spin case. For any 
spinor wy with (0,; + i(e/hc)A/)w € L?(R?;C’) for j = 1, 2, 3, the following 
expression makes sense and serves as a definition of the kinetic energy in 
(2.1.54): 


= / lo (p " ~A(x)) w(x) de. (2.1.56) 
EY 


If the vector potential A is sufficiently smooth, an integration by parts and the 
use of the commutation relations (2.1.51) and (2.1.52) shows that the above 
expression can be rewritten as (2.1.54). Thus the Zeeman-term is hidden in 
(2.1.56). The advantage of (2.1.56) is that no smoothness assumption on A has 
to be made. Moreover, the positivity of the kinetic energy is apparent from the 
formulation (2.1.56). 


° For convenience, we use the same absolute value symbol for the length of a vector in C? or 
C? @ C? as we used earlier for the length of a vector in C3. 
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The Hamiltonian associated with (2.1.54) and (2.1.56) is given by 
2 


= (p + © a(x)) + woo - B(x) = > [> (p + ~A(x)) (2.1.57) 


2m 


This operator is called the Pauli operator’? and will be discussed in detail in 
Chapter 9. 

An important concept is gauge invariance. Physical quantities, like the 
ground state energy, depend on A only through B. This can be seen as fol- 
lows. If there are two vector potentials A, and A», with curl A; = curl Az = B, 
then A; = A> + Vx for some function x. This follows from the fact that a curl- 
free vector field is necessarily a gradient field.'' We emphasize that this is true 
only on the whole of R? and not on punctured domains, such as the exterior of an 
infinitely extended cylinder. Gauge invariance of the Hamiltonian H,4 means that 


Hayy = U()HaU(y)", 


where U(x) is the unitary multiplication operator e’/"9*™), and U? denotes 
the adjoint of U. 


2.1.7 Units 


Anyone who has studied electromagnetism knows that the problem of choosing 
suitable units can be a nightmare. This is even more so when we want to include 
h and simultaneously make the system convenient for quantum mechanics. 
One solution, favored by many physics texts, is to include all physical units 
(m, e,h, c) in all equations. While this is certainly clear it is cumbersome and 
somewhat obscures the main features of the equations. 

Except for the gravitational constant G, which will be discussed in Chap- 
ter 13, there are four (dimensional) physical constants that play a role in this 
book. These are 


¢ m =mass of the electron = 9.11 x 10-78 grams 


e e =(—1)x charge of the electron = 4.803 x 107!° grams!/? cm?/? sec™! 


A closely related operator that is often used is the simpler Pauli-Fierz operator [64]. It is the 
same as (2.1.57) except that the term (e? / 2mc?)A(x) is omitted. This operator is not gauge 
invariant (see Chapter 10) although it is useful when the A field is small. However, the absence 
of the A” term can, if taken literally, lead to instabilities. 

A formula for x is given by the line integral x(x) = [p(A 1 — A): ds. 
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e f = Planck’s constant divided by 27 = 1.055 x 10-7’ grams cm? sec™ 


¢ c =speed of light = 3.00 x 10!° cm sec™!. 


(Note that # has the dimension of energy times time.) These are the conventional 
cgs (centimeter, gram, second) electrostatic units. In particular, the energy 
needed to push two electrons from infinite separation to a distance of r 
centimeters is e7/r. 

In our choice of units for this book we were guided by the idea, which really 
originates in relativistic quantum mechanics, that the electron’s charge is the 
quantity that governs the coupling between electromagnetism and dynamics 
(classical or quantum, relativistic or non-relativistic). When e = 0 all the par- 
ticles in the universe are free and independent, so one wants to highlight the 
dependence of all physical quantities on e. To emphasize the role of e, we intro- 
duce the only dimensionless number that can be made from our four constants — 
the fine-structure constant: 


a := e?/he = 1/137.04 = 7.297 x 107°. 


We should think of / and c as fixed and aw as measuring the strength of the 
interaction, namely the electron charge squared. Our intention is to expose the 
role of e clearly and therefore we avoid using units of length, etc. that involve e 
in their definition. 

Next, we need units for length, energy and time, and the only ones we can 
form that do not involve e are, respectively: 


h 1 
hc = — = — x Compton wavelength of the electron = 3.86 x 10-'' cm 


mc 2m 
(2.1.58) 
as the unit of length, 


mc” = rest mass energy of the electron = 8.2 x 10~’ ergs (2.1.59) 


as the unit of energy, and 


Ac h 19 
—=-35= 1.29 x 10°” sec (2.1.60) 
c mc 
as the unit of time. In other words, we shall seth = m = c = 1, andthuse = /a. 
These will be the units used throughout the book. 

Thus, our quantum mechanical wave function w(x) equals i aya y) with 
x being given in terms of the dimensionless y by x = Acy. The subsequent 
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expressions will be written in terms of y (which will henceforth be called y, 
and the dimensionless argument y will henceforth be called x). 

In non-relativistic quantum mechanics (without magnetic fields) the speed of 
light does not appear and then the ‘natural’ units of length and energy are the 
Bohr radius and the Rydberg: 


Aponr = h?/me? = Ac/a = 5.29 x 10°? cm (2.1.61) 
Rydb A nO eT er (2.1.62) 
ere = — TS = = 24. x Z I 
y g We gine a mc 


These units involve e, however. While they are ‘natural’ for chemistry and atomic 
physics, they are not really the most convenient for dealing with relativistic 
dynamics and with the interaction of particles with the electromagnetic field. 
Thus, our unit of length, the electron Compton wavelength, is about 0.007 Bohr 
radii and our unit of energy is about 19000 Rydbergs. These units have the 
advantage that e appears only in a and nowhere else. 

Turning to units of the magnetic field, the equation for the Lorentz force 
(2.1.11) fixes the unit of B. In these units, the energy required to create a 
magnetic field B is given by (2.1.16). If we still want to use (877)! f | B(x)|?dx 
as the energy of the electromagnetic field in our new units, B//hc has to have 
dimension of length~*. From this it follows that the vector potential A//hc has 
the dimension of length~!. Thus, we choose 


re! Vhc as the unit for A 
re Vhie as the unit for B. 


The argument of both vector fields is the dimensionless quantity he te 

Our unit for B is unnaturally large, for it is 3.77 x 10° Tesla or 3.77 x 10!” 
Gauss, where 1 Gauss equals 1 gram!/* cm7!/? sec~!, which is of the order of 
the magnetic field on the surface of many neutron stars. By comparison, it is an 
achievement to produce 10 Tesla = 10° Gauss in the laboratory. 

Thus, for an atom with a single electron (i.e., a hydrogenic atom), our non- 
relativistic Hamiltonian (in units of mc”, with x being the length in units of the 
Compton wavelength and with p = —iV) becomes 


y-! 2_ Za 


; 2.1.63 
5? ix| ( ) 
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In these units (2.1.37) becomes 


1 Za 
ew= f (Guvmer-Ziverr)ar 2.1.64 
R3 
Similarly, the ‘relativistic’ Hamiltonian becomes 
Za 
H=Vp?+1-1- ie (2.1.65) 
x 


The kinetic energy (2.1.54), including the interaction with a magnetic field and 
the energy of this field, becomes 


1 1 
; y Iv +i Vaan)? dx + Va / (W(x), 0 » BOe)y(x)) dx 
R3 


R3 


1 
+= | 1Boprax. (2.1.66) 
Sir 
R3 


The corresponding Hamiltonian is 


1 
H==(p+ Va A(x))” + ve - B(x) + on i |B(x)|?dx. (2.1.67) 


R3 


Nile 


2.2 The Idea of Stability 


The expected value of the energy in a state w is E(w) and we can ask about its 
range of values. Obviously €(w) can be made arbitrarily large and positive, but 
can it be arbitrarily negative? (Recall that we are interested in normalized y, 
1.e., f |w|? = 1.) The answer in the classical case is yes! That is, the function 
H(x, p) is not bounded from below since —1/|x| can be arbitrarily negative 
and p can be zero. Any classical Coulomb system with point charges (in which 
the charges are not all positive or all negative) is an unlimited source of energy. 
In contrast, the lowest energy of an electron in a quantum-mechanical atom is 
finite — as we shall see. It is one Rydberg for hydrogen (Z = 1) and is of the 
order of 10* Rydbergs in large atoms where Z ~ 90. 

As mentioned before, one could object that the nucleus is not really a point 
particle and hence the Coulomb potential is not really an infinite energy source. 
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However, the radius of a nucleus is about 10~!° centimeters, which must be 
compared with the 10~® centimeter radius of the hydrogen atom, i.e., the Bohr 
radius. The distinction between a point nucleus and a 10~!> centimeter nucleus 
would be significant only if energies of the order of 10° times the energy of 
hydrogen played a significant role, which they do not. Such energies are effec- 
tively ‘infinite’ on the scale we are concerned with. The somewhat astonishing 
fact that an electron is forced to stay away from the nucleus (i.e., it is mostly to be 
found at the Bohr radius instead of the nuclear radius) is one of the most impor- 
tant features of quantum mechanics. Planck’s constant introduces a new length 
scale, the Bohr radius and, concomitantly, introduces a scale of energy. This 
scale of energy was not present in pre-quantum physics and it is one Rydberg. 

Thus the question is raised whether quantum mechanics sets lower bounds on 
the energy of electrons in atoms, i.e., is 


Ey = int} EH): f WxyPde = 1 (2.2.1) 
R3 


finite and, if so, what is its value? Note that if the infimum is a minimum (.e., 
Eo = E(w) for some Wo) then Ep is the lowest energy value the system can 
attain and is therefore called the ground state energy, and Wo is called a ground 
state. We will call Eo the ground state energy even if the infimum in (2.2.1) is 
not attained by any function wy. Indeed, it is always the case that a minimum is 
not attained if there are too many electrons in an atom (see Chapter 12). 

The finiteness of Eo is stability of the first kind. That quantum mechanics 
achieves this sort of stability is of great importance, for it resolves one of the 
crucial problems of classical physics, and it will be discussed extensively in the 
chapters to follow. 

Assuming for the moment that Ep in (2.2.1) is attained for some Wo, a simple 
variational calculation leads (see Chapter 11 in [118]) to the (stationary, or 
time-independent) Schrédinger equation for yo: 


Hw = Eowo. (2.2.2) 


In general, Ep is not the only value for which Eq. (2.2.2) has a solution. There 
are usually infinitely many of them and they are called the eigenvalues of H. 
(See [118, Sects. 11.5, 11.6].) They label all the stationary states of the atom and 
the difference between two eigenvalues determines the frequency of light which 
is emitted when an electron falls from a higher stationary state to a lower one. 
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In this book, we shall not be concerned with the question of whether Eo is an 
actual eigenvalue of the Hamiltonian H, except for Chapter 12 on the ionization 
problem. Even less will we be concerned with the existence of other stationary 
states. Rather we investigate whether Ey is bounded (for stability of the first 
kind) or bounded by the number of particles (for stability of the second kind, 
which will be defined in Chapter 3, Section 3.2). 


2.2.1 Uncertainty Principles: Domination of the Potential Energy by 
the Kinetic Energy 


Any inequality in which the kinetic energy Ty, dominates some kind of integral 
of w (but not involving Vy) is called an uncertainty principle. The histor- 
ical reason for this strange appellation is that such an inequality implies that 
one cannot make the potential energy very negative without also making the 
kinetic energy large, i.e., one cannot localize a particle simultaneously in both 
configuration and momentum space. The most famous uncertainty principle, 
historically, is Heisenberg’s: For yy € H'(R?) and ale =(v,Wv) = 1, 


dad 
(Wy, pw) = a xu) (2.2.3) 


The proof of this inequality (which uses the fact that V-x —x-V=dlIin 
R¢) can be found in many textbooks and we shall not give it here because 
(2.2.3) is not actually very useful. The quantity (y, x7w)~! is a poor indicator 
of the magnitude of Ty = (1/2)(y, p’w). If the particle is concentrated near 
the origin, then 7, is large, but w can easily be modified in an arbitrarily small 
way (in H'(R®) norm, i.e., in such a way that both the difference between 
the function and its modification and the gradient of this difference is small 
in L?(R¢) norm) so that (y, x?) becomes huge without (y, pw) becoming 
small. In other words, by a tiny modification of w, (w, x7y)~! can be made small 
even if Ty, is large. To see this, take any fixed function w and then replace it by 
Wwy(x) =v1—- 67 (x) + ew(x — y) with e < 1 and |y| > 1. To a very good 
approximation, Wy = w but, as |y| > on, ||Wyll2 > 1 and (Wy, x Wy) > Oo. 
Thus, the right side of (2.2.3) goes to zero as |y| — oo while Ty, ~ Ty does 
not go to zero. 

A much more useful inequality is Sobolev’s inequality. Recall the definition 
of the L?(IR7) norms 

I/p 


lvl, = / bye(x)|Pdx 
Rd 
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for 1 < p < oo. For p = ov, one uses ||P ||.o = sup, cpa |W(x)|. Any function 
vw whose gradient is in L7(R“) and that vanishes at infinity (meaning that 
the measure of the set where |y(x)| > w is finite for any > 0), with d > 3 
(these conditions are important), is automatically in LPR), although not 
necessarily in any other L?(R“) space. There is the inequality 


2Ty = (IV = / [Vw(x) dx > SallWllbaa—2) (2.2.4) 
Rd 


for some positive constants Sg. For d =3, the optimal constant is $3 = 
3 (427)?! 3. See [118, Chapter 8] for further information. 
For d = | and d = 2, on the other hand, we have 


2Ty > Solves? wie? for all2< p<oo, d=2 (2.2.5) 
2Ty = Ivy wis. d=1. (2.2.6) 


Moreover, when d = | and y € H'(R'), w is not only bounded but it is also 
continuous. 
An application of Hélder’s inequality, 


[ fener <\fllpliglg  forl<p<oo, p'+q'=1, 
Ra 


to (2.2.4) yields, for any potential V € L4“/?(R*), d > 3, 
Ty = SallWllgaya_2 = Sah IVIWIV Mla />- (2.2.7) 
An immediate application of (2.2.7) is that 
Ty + Vy = 0 (2.2.8) 


whenever || V ||a/2 < Sg, and thus stability of the first kind holds for such poten- 
tials V. 

A simple extension of (2.2.7) leads to a lower bound on the ground state 
energy for V € L4/2(R¢) + L©(R2), d > 3, i.e. for potentials V that can be 
written as 


V(x) = v(x) + w(x) (2.2.9) 


for some v € L4/?(R“) and w € L™(R®). There is then some constant A 
such that h(x) := —(v(x) — A)_ = min(v(x) — 4,0) < O satisfies ||A||a/2 < 
5 Sa (we leave this as an exercise for the reader). In particular by (2.2.7) 
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hy =(W, hy) = —5Ty. Then we have 


E(W) = Ty + Vy = Ty +(U- Ay HAF Wy = Ty thy +At+ wy 


1 
> 5 Ty +h — llwlleo (2.2.10) 


and we see that A — ||w||.. is a lower bound to Eo. Furthermore (2.2.10) implies 
that the total energy effectively bounds the kinetic energy, i.e., we have 


Ty < 2(E(W) — A+ ||wloo). (2.2.11) 


So far we have considered the non-relativistic kinetic energy, Ty = 
(1/2)(W, p*v). Similar inequalities hold for the relativistic case. Because 
|p| > / p? +m? —m = |p| —™, it suffices to consider the ultra-relativistic 
energy 7, = (W, |p|w). The relativistic analogues of (2.2.4)—(2.2.6) are (2.2.12) 
and (2.2.13) below. There are constants S’, for d > 2 and Si» for 2 < p < oo 
such that 


Ty = SiIW bana d>2 (2.2.12) 
and 
Ty = Sly? wi?/@- for all2 < p<oo, d=1. (2.2.13) 
For d = 3, S = (207)"/ [118, Thm. 8.4]. 


The results of this section can be summarized in the following statement. Jn 
all dimensions d > 1, the hypothesis that V is in the space 


LAP?) +L) itd = 3 
non-relativistic 4 L'+®(R2) + L©(R2) ifd=2 (2.2.14) 
Li(R')+Le(R!)—ifd=1 


_.., JLYRY+L°(R4)— ifd >2 
relativistic } (2.2.15) 
LiteR')+ LOR!) ifd=1 
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(for some € > 0) leads to the following two conclusions: 


Eo is finite 


Ty < CE(W) + DWI 


when w € H'(R@) (non-relativistic) ory € H'/?(R®%) (relativistic) and for suit- 
able constants C and D which depend only on V and not on wy. 


2.2.2 The Hydrogenic Atom 


The hydrogenic atom is the simplest conceptually and computationally. It has 
one electron and one nucleus with charge Z > 0 (the nucleus of real hydrogen 
in nature is the proton, which has Z = 1). If we assume that the nucleus is fixed 
at the origin x = 0, then the energy is given by (2.1.64) as 


1 Z 
E(W) = i (Gir weor - Sivoo?) dx. (2.2.16) 


|x | 
R3 


The Coulomb potential V(x) = —Za/|x| is in L*~*(R*) + L™(R’) for any 
€ > 0, but not in L3(R*) + L™©(R?). Hence the hydrogenic atom is stable non- 
relativistically as a consequence of Sobolev’s inequality, as discussed in the 
previous section. In fact, it is well known that 


1 >, Za (Za) 5 

[ Gewen = PTA ) ax = Sf wco dx, (2.2.17) 
R3 R3 

with equality if and only if w(x) = C exp(—Za|x|) for some constant C € 

C. In fact, this y can easily be shown to satisfy the Schrodinger equation 

(—A/2 — Za/|x|)w(x) = —(Za) W(x) /2, and since it is a positive function, it 

must be the eigenfunction corresponding to the lowest eigenvalue. (See [118, 


Section 11.10] for details; see also Corollary 3.1 in Section 3.2.4.) 
An equivalent formulation of the inequality in (2.2.17) is 


2 


2 
[ivwcpPax f ivonrar > {= dx (2.2.18) 
R3 R3 R3 
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for all functions yw ¢ H'(R*), which is obtained by choosing Za = 
Cf lwl?)-! fiw? /\x| in (2.2.17). This inequality will be useful later in 
Chapter 9. 

The analysis in the previous section shows that the relativistic case is 
borderline. That is, 1/|x| fails to be in L3(R*) + L©(R%), although it is in 
L?-*(R3) + L™®(R?) for any € > 0. We shall in fact see in Chapter 8 that there is 
a critical value of Za up to which there is stability, and beyond which stability 
fails. 


CHAPTER 3 


Many-Particle Systems and Stability of the 
Second Kind 


The main purpose of this book is to study the ground state energies of quantum- 
mechanical systems of particles interacting via electric and magnetic forces. 
The present chapter defines the mathematical framework of the problem. We 
shall define the kinds of wave functions that have to be considered and their 
permutation symmetries. The ground state energy will be defined as a variational 
problem, and the concept of stability of the second kind will be discussed. 


3.1 Many-Body Wave Functions 
3.1.1 The Space of Wave Functions 


A wave function for N spinless particles is any function y : R*% — C which 
is in L7(R°%) = @" L?(R) with unit norm, i.e., 


IWS = if IW(x4,...,Xy)|?dx,---dxy = 1. (3.1.1) 
R3N 


Here x; € R> is the (spatial) coordinate of the i” particle. Our notation is 
that subscripts label the particles, while particle coordinates are denoted by 
superscripts, e.g., X1 = (x{, aa x), It is convenient to denote a point in R*” by 


X :=(X1,...,XN) (3.1.2) 
and 
dX := dx,---dxy. (3.1.3) 


An important historical point is to be noted here. It might have been thought 
that the correct generalization for N particles is to use N functions of one variable 
instead of one function of N variables. Such a ‘wrong turn’ did not happen 
historically, which is, after all, remarkable. Nevertheless there are simple, but 
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interesting and useful functions of N variables that can be built out of N functions 
of one variable called determinantal functions and these will be discussed later. 
They are important in an approximation to quantum mechanics called Hartree 
and Hartree—Fock theories, which we will not discuss. 

The interpretation of y is that |w(X)|? = |W(x1, ..., x)|? is the probability 
density for finding particle | at x,, particle 2 at x2, etc. This interpretation 
makes sense in view of condition (3.1.1). In the case of one particle, one might 
be tempted to interpret |7/(x)|* as being proportional to a mass or charge density. 
In the N-particle case this interpretation is impossible since w is not a function 
of one variable. There is, however, a natural function of one variable associated 
with y, namely, for x € R?|! 


N 
oy @)= > e@ (3.1.4) 
i=1 
where 
Qy,(x) = / [We Min XK Hw) de + die; dey 
R3W-1 
(3.1.5) 


which, on account of (3.1.1), is the probability density of finding particle i at x. 
The notation dx; means that the integration over the i" coordinate is omitted. 
Obviously [ Qi, = |. Thus Q,,(x) is interpreted as the total electron density at the 
point x and is customarily called the single or one-particle density. It satisfies 


[ evenax =n. (3.1.6) 
R3 


If the i particle has electric charge e; the analogous charge density can be 
defined as 


N 
Oy(x) = D> Oi), (3.1.7) 
i=l 
where 


Oi,(x) = e10),(x). (3.1.8) 


' Note that @ is the particle density. It should not be confused with the mass density, which equals 
the particle mass times Q. 
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For non-relativistic quantum mechanics it is also required that y has finite 
kinetic energy, namely y is in H'(R>%), i.e., both y and each of the 3N com- 
ponents of Vw = (Vx, W,..., Wx, W) are functions in L?(R?%). The gradient is 
defined in the sense of distributions (see Chapter 6 in [1 18]). The kinetic energy 
w is then defined to be 


N 
T= oT (3.1.9) 
i=l 
where 
‘ 1 
Tj, =—— / (Vx, W(X) dX, (3.1.10) 
2m; 
R3NV 


and where m; is the mass of the i particle. Recall that for our choice of units, 
discussed in Section 2.1.7, the mass of an electron is 1. In the case of relativistic 
quantum mechanics 7 need only be in H'/?(IR3%), In this case the Ti, above are 


replaced by 
Ti, = i (Vani? +m? — m) [WK PdK (3.1.11) 


R3N 
(see Chapter 8 for more details), where K = (k,..., ky), and 
N 
PK) = | wXexp( - 2x1 Dox) -kj)dX 
R3N j=l 


denotes the Fourier transform of w. 
Accordingly, the potential energy of yw is defined to be 


ie / VXOIW(X)PaX, (3.1.12) 
IRIN 


where the function V(X) is the total potential energy function. In the case of a 
molecule, V(X) is given by (2.1.21)-(2.1.24). 


3.1.2 Spin 


In addition to the spatial coordinates, x, a particle may also have internal degrees 
of freedom. The most important of these is spin, but there could be others such 
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as isospin, flavor, etc. For the problems addressed in this book the precise nature 
of the internal states is not important and we shall, therefore, adopt the following 
convention: If a particle has q internal states available to it we shall say that the 
particle has q spin states and label them by the integer 


o €{1,2,...,q}. 


Electrons have two spin states and these are conventionally designated by 
o =+1/2 or —1/2. Here, o = 1 or 2 will be used, as stated above. From 
the mathematical point of view it is interesting to retain an arbitrary g in our 
formulation. This is due to the fact that the Pauli principle (explained below) 
introduces interesting dependencies of various quantities on g. More than that, 
by incorporating the g dependence explicitly we will be able pass from fermions 
to bosons, as will be explained in the next subsection. Bosons will be seen to 
correspond to g = N, the number of particles. For this reason alone it is worth 
retaining the dependence on q. 

Suppose there are N particles and the i particle has g; spin states. A wave 
function for these N particles can then be written as 


W(X1,01,...,XN, On) (3.1.13) 


where, for each choice of the sigmas (with 1 < 0; < q;), the function in (3.1.13) 
is an element of H!(R*%). Alternatively, yy can be viewed as an H!(R*%)- 


N 

function with values in C2 with Q := J] qj. (This means simply that y can be 
i=1 

thought of as Q functions in H 'aR?%), with o1,..., an furnishing the labeling 

of these functions.) The set of these C2-valued functions will be denoted by 
N 

H'(R?%; C2), and it is isomorphic to @) H'(R?; C%). In analogy with (3.1.2) 


i=1 


the spins are collectively denoted by 
Oo := (0],..., On) 
and 
qN 
oj=1 on=1 


With this notation (3.1.13) can be conveniently written as w(X, co). Another 
convenient notation is 


Zi =(%;,0;) and Z=(Z%,...,ZN) (3.1.14) 
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and, for any function f(x;, 0;), 


qi 
[ favs, = » | fe car; (3.1.15) 


di=lp3 


Using the above notation, (3.1.13) will sometimes be written as 


W(Z1,.--,2n) or wi). 


The replacement of the normalization condition (3.1.1) is 


Wwik= of wa.oPax = f weeds =. 


fo IRB 


3.1.3 Bosons and Fermions (The Pauli Exclusion Principle) 


As explained in the previous section, the coordinate of a particle is a point in 
R? x {1,...,g},ie., a pointx € IR? and apointo € {1,...,q}. The value of g 
is a fixed property of the kind of particle, i.e., whether the particle is an electron, 
proton or whatever. Recall the notation z; = (x ;, 0;) for the j 'h particle. 

It is a basic postulate of quantum mechanics that the allowed wave functions 
for a given particle species must belong to some definite permutation symmetry 
type which is characteristic of that species. There are two kinds of particles in 
nature, called bosons and fermions. Bosons are characterized by the fact that 
a wave function describing a system containing several identical bosons, e.g., 
positively charged pions, must be totally symmetric with respect to exchange 
of any pair of boson coordinates. That is, if z; and z; are the space-spin variables 
of two identical bosons, 


Wve Rippon ey asa) = Wise Rh ong Biche): 


Fermions, on the other hand, demand total antisymmetry of the wave functions. 
That is, 


cl Geer <perreee A eee eel Ceres a eee ae 


if z; and z; are the coordinates of identical fermions. The antisymmetry property 
is usually rephrased by saying that fermions obey the Pauli exclusion principle. 

Terminology: It is common in the physics literature to refer to the choice of 
symmetry type as the statistics of the particles or of their wave function. More 
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exactly, the requirement of symmetry under coordinate exchange is referred to 
as Bose-Einstein statistics, named after S. N. Bose who analyzed the states 
of photons (quantized particles of light) and A. Einstein who applied Bose’s 
method to ordinary massive particles. The antisymmetry requirement is referred 
to as Fermi—Dirac statistics, after E. Fermi and P. A. M. Dirac. 

In general, one considers several species of particles simultaneously. There 
is just one wave function describing such a system, and it has to be symmetric 
with respect to exchange of particle coordinates corresponding to the same 
species of bosons, and antisymmetric for particles corresponding to the same 
species of fermions. For example, a wave function describing both positively and 
negatively charged pions (which are bosons) has to be symmetric separately in 
the coordinates corresponding to the positively and negatively charged particles, 
but there is no symmetry requirement for exchange of coordinates corresponding 
to different species. 

In the following chapters, we will mostly be concerned with wave func- 
tions describing electrons and different kinds of nuclei. While the electrons 
are fermions (with g = 2), nuclei can be either fermions or bosons. Nuclei of 
different charges are, in any case, independent, i.e., they belong to different 
species. 

A simple example of a symmetric function of N variables is yw(z) = 
ies u(z;), where u is any function in L?(R?; C2) of one variable, z. 

The simplest example of an antisymmetric function is obtained by taking 
any N functions u;(z) € L7(R?;C%), i =1,..., N that are orthonormal, i.e., 
f uj(Z)u j(z)dz = 6; ;. The N-particle wave function 


w(z) = (NI? detfui(z <1 (3.1.16) 


is antisymmetric and normalized. It is called a determinantal function, or 
Slater determinant. The subspace of L7(R°%; C7") consisting of all antisym- 
metric functions is denoted by a L?(R?; C4), where /\ stands for the anti- 
symmetric tensor product. If {w;(z)}°, forms a basis for L?(R?; C2) then all 
the determinantal functions given by (3.1.16), using all possible choices of N of 
the functions u;, form an orthonormal basis for al L?(R?3; C4). 

Another fact of nature, which can only be explained using relativistic quantum 
field theory, is that bosons have an odd number q of spin states (one says the 

g-1 


spin, defined to be a is an integer) and the fermions have q even, i.e., the 


spin a is 1/2, 3/2, etc. This will not be of any importance in our discussion, 
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however, because this restriction is not mathematically natural in our context. 
We shall consider all integers g in this book. In the literature one sometimes 
comes across academic g = 1 fermions, which are called spinless fermions. 
One can think of this as a situation in which the particles are g = 2 fermions, 
but the wave functions are restricted to be those in which w = 0 unless all o; 
are 1. In treating bosons here we shall only consider g = 1, but everything can 
be easily generalized to arbitrary q. 


3.1.3.1 The case g > N 


Suppose we have a Hamiltonian H that does not depend on spin. That is to 
say, it acts on L?(R3%), for example the Coulomb Hamiltonian, which will be 
discussed in Section 3.2. This Hamiltonian can be extended in a trivial way to 
H @ 1, where I is the identity on the spin space C4 "It acts on Dae Ce oe) — 
L?(R3%) Q cq 2 

Consider now the case where g > N, N being the number of particles. For 
fermions with q spin states, the physical wave functions are totally antisymmetric 
functions in L7(R3%; C4 ny, We shall now show that for g > N, this fermionic 
system is equivalent to considering g = 1, i.e., the original Hamiltonian H on 
L?(R3%), without any symmetry restrictions on the wave functions, and without 
any spin degrees of freedom. This can be seen as follows. 

Take any function $(x1,..., x) of N space variables, and let 


1 
W(Z1,---,2Zu)= Ta oe Xn) filoi): ++ fron), 


where f;(o) = 1 foro =i and 0 otherwise, and where A denotes total antisym- 
metrization. That is, 


Alx(Z1,.--,Zw)] = 2 ExX(Zn(1)s ++ +s Zx(N))s (3.1.17) 

mwESN 
where the sum runs over all permutations of the numbers 1,..., NV, and é, = 1 
for even permutations, and ¢, = —1 for odd permutations. The function y is 


clearly an antisymmetric function in L7(R*"; C4 "), and it is not identically zero 
if @ is not. In fact, the norm of w is the same as the norm of @. This follows 
from the fact that the functions f; form a set of orthonormal functions in C?. 


2 Note that @” C4 = C” , not C2", ie., the dimension is q™ not qN. 
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Moreover, for the same reason 


(vy, H @Ilv)=(@, H¢) 


if H is invariant under permutations. In particular, since @ was arbitrary, the infi- 
mum of (W, H @ Ivy) over all antisymmetric w equals the infimum of (¢, H¢) 
over all @. We shall use this fact later when discussing stability of bosonic 
systems. 

We note that if H is permutation invariant, the @ that yields the lowest value 
of (¢, Hd) among all possible (normalized) functions can be taken to have a 
definite symmetry type without loss of generality, just as an energy minimizing 
state for a rotation invariant H can be taken to have a definite angular momentum. 
‘Symmetry type’ refers to representations of the permutation group, i.e., to 
the behavior of a function under permutations, and symmetric (i.e., bosonic) 
and antisymmetric (i.e., fermionic) are but two examples of symmetry types. 
There are other symmetry types than symmetric or antisymmetric but these 
will not be used in this book. (For more information on representations of 
the permutation group, we refer to any textbook on finite groups, e.g., [84].) 
The energy minimizer need not be bosonic or fermionic, however. Indeed, for 
systems with magnetic fields or systems under rotation it is known that the 
energy minimizer is, in general, neither bosonic nor fermionic [162]. In the 
absence of magnetic fields, the minimizer can be shown to be always bosonic; 
we will show this in Corollary 3.1 on page 59. 


3.1.4 Density Matrices 


N 
Given a normalized function y € L?(R?*%; ca") = @ L?(R?; C4) there is nat- 
urally associated with it an orthogonal projection Py on L7(R*%; C4 o) whose 
action on a function @ € L7(R°*%; C2") is given by 


(Py AZ) = , IW@ (3.1.18) 


with (Ww, @) = f W@)b@)dz. Clearly [yy = Ty and ly is self-adjoint, i.e., 
(¢', 'y¢) = Ty ¢’, @) for all ¢ and ¢’. This Ty has two other important 


3 A notation that is often used is Dirac’s bra and ket notation ly =|wW)(w|. In this notation the 
inner product is given by (|) instead of (yy, ), and the matrix elements of an operator H are 
denoted by (y|H|@) instead of (Ww, H@). 
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properties: 
(¢, Ty) =0_ positive semidefinite, 
Trly, =1 unit trace. (3.1.19) 


The symbol Tr denotes the trace, which can be computed in any orthonormal 
basis {¢y} as >> (da, Pyba). By choosing ¢; = w and the other dg to be 
orthogonal to w, Eq. (3.1.19) is easily deduced. 

The projection I'y, has an additional property. It has one eigenvector with 
eigenvalue 1, namely y itself, [yw = y, and all other eigenvalues are zero. It 
is easy to prove that any linear, self-adjoint operator I that is positive semidefinite 
and has unit trace, and has one eigenvalue equal to | has the form (3.1.18). The y 
appearing in (3.1.18) is the unit eigenvector of I, i.e., the w such that "y = y. 

Any operator of the form (3.1.18) is called a pure state density matrix. A 
general density matrix is any linear, self-adjoint positive semidefinite operator 
with unit trace — the condition that 1 is an eigenvalue is dropped. In some sense 
such an operator can be viewed as a generalization of the concept of a wave 
function; this generalization is not only useful but it is actually forced upon us, 
as will be seen shortly. 

The two conditions (3.1.19) on a density matrix I" will be written as 


0<I <I, Trl = 1, (3.1.20) 
where [is the unit operator, I w = w forall y. The condition < I, whichis an 
easy consequence of Tr I’ = 1 and the condition > 0, means that (w, lw) < 


(w, w) for all yw. Condition (3.1.20) is a far stronger statement than simply that 
I is a bounded operator. In fact (3.1.20) implies the eigenfunction expansion 


CO 
r= Shits (3.1.21) 
j=l 


where the functions y; are an orthonormal basis for L7(R3; C4 ") and the real 
numbers A, which are the eigenvalues of I’, satisfy 


Aq = dig 2 wee), a=. 


Also, Pw; =A; W; for each j. In other words, I is a convex combination of 
pure state density matrices. The proof of (3.1.21) is left to the reader (or see, 
e.g., [168, Thm. 1.4]). 
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We say that T is an H! density matrix if each w; in (3.1.21) is in 
H'(R°%;, C1) and if 


CO 
SAV Y{llZ < 00, 


j=l 


with 4; being the eigenvalues of I’. Note that this is a stronger condition than 
merely assuming that maps L7cR?* :C4") into H'(R3% C4"); this weaker 
condition is yo IVY Alb < oo. Analogous definitions can be made for 
H'/? density matrices. 

Equation (3.1.21) also implies that I has a kernel function (which we also 
denote by I), i.e., 


(Ty) = / T(z, z)y(z')dz’. (3.1.22) 


The self-adjointness implies that T(z, z’) = I'(z’, z). To calculate I'(z, z’) we 
first note that in the simple case of a pure state I"y, the kernel is obviously just 
W(z)w(z’). Then from (3.1.20) and an easy proof using the dominated conver- 
gence theorem to exchange limits and integrals, the following representation of 
T(z, z’) can be obtained: 


(zz) = > Aww j@). (3.1.23) 


j=l 


There is a technical point that has to be noted. Equation (3.1.22) does not define 
the kernel function on the diagonal where z’ = zZ, i.e., '(z, z) is undefined for all 
z = (X, a). The reason is that the set of points (X, X) in R3% x R? is merely 
a set of 6N-dimensional Lebesgue measure zero. Therefore, the ‘well known’ 
equation 


Trl = fre. z)dz (3.1.24) 


does not follow from (3.1.22). However, if we agree to define T(z, z) by (3.1.23), 
namely 


TE. 2) = Dd Aly @lP (3.1.25) 


j=l 
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then (3.1.24) is true because 


(oe) 
T= Ay 
j=l 


and each y; is normalized. Note that (3.1.25) defines ['(z,z) only for 
almost every z, but that is sufficient for (3.1.24). The function I'(z, z) on 
(R? x {1,..., q})" defined using (3.1.25) is called the diagonal part of the 
density matrix I. 

The eigenfunction expansion (3.1.21) is also important for defining energies 
and expectation values of certain operators. In Sections 3.2.1 and 3.2.2 we shall 
define, for any function y € H'(R*%; C2"), an energy €(y) which is a quadratic 
form in y and to which is formally associated an operator H (the Hamiltonian), 
ie., E(w) = (Ww, Hy) in the sense discussed in (2.1.36). We now use (3.1.21) to 
define the traces 


Tr AT =TrPH := 9) AajE(y) =: ED). (3.1.26) 


j=l 


For unbounded operators H the expressions Tr HI and TrI°A are not well 
defined a priori but the right side of (3.1.26) is perfectly well defined (provided, 
of course, that VAIEMAL < oo.) The same notation (3.1.26) can be used to 
define Tr AT = TrIT’A for any operator, A, formally associated with a quadratic 
form A(w). 


It is an obvious, but important consequence of (3.1.26) that 
inf{E(w) : (WwW, W) = 1} = inf{E(L) : T is a density matrix}. 


One can speak of I’ belonging to a certain permutation symmetry type. It 
simply means that each y; in (3.1.21) belongs to this type. In Section 3.1.3 it 
was stated that the only physically relevant permutation types are the totally 
symmetric (bosonic) or totally antisymmetric (fermionic) ones for each particle 
species, and it will be assumed henceforth, unless otherwise stated, that T° 
belongs to a physical symmetry type. 


3.1.5 Reduced Density Matrices 


Assume, for simplicity, that the system under consideration contains only one 
species of particles, either fermions or bosons. The generalization of the follow- 
ing concepts to multiple species is obvious and left to the reader. 
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If T is a physical (bosonic or fermionic) N-particle density matrix with kernel 
T(z, z’) and if 1 < k < N we can define the kernel of the k-particle reduced 
density matrix to be 


N! 
(k) ae) i. ) 
Zigoeee DES Sa goes Kp) See PD is ee Ze Sg es EWG Za oes 
v1 eal a) wep | Pe pete N32] 
ie fe eee a8 4 eee ae 
(3.1.27) 


Remarks 3.1. (1) The right side of (3.1.27) is formal (for the same reason 
that the right side of (3.1.24) is formal) but it acquires a unique meaning if the 
eigenfunction expansion (3.1.23) is used. 

(2) Note that y is not normalized to have trace equal to one. Normalization 
conventions other than N!/(N — k)! are often used in the literature. 

(3) In (3.1.27) the variables Z,41,...,Zy are integrated out. Because I" is 
bosonic or fermionic any other N — k variables could have been chosen. Also, 
the ordering of 1, 2,..., k is arbitrary. Indeed, the correct definition of y® when 
I’ is neither bosonic nor fermionic is to take all these possibilities and add them 
together with a weight 1 instead of N!/(N — k)!. When I is totally symmetric 
or totally antisymmetric all these choices give the same result, and that is why 
(3.1.27) is true in this physical case with the factor N!/(N — k)!. In the case 
of several species of particles it is obvious how to define the reduced density 
matrices for each species — or for a mixture of several species. In the interest 
of keeping the notation simple, we shall resist the temptation to write down the 
most general formula in the following treatment. 


Example: For a determinantal wave function (3.1.16) the k-particle reduced 
density matrix is expressed in terms of the u1,..., uy by 


YOR 5 oes BZ yy ve Ze) = ke! Ta — = det | {uz,(z; yy det{u,,(z/)) yi jet 


i,j= oe 
(3.1.28) 


The sum runs over the (7 ) choices, denoted by t, of k functions from among 
the N given ones. In particular, the one-particle density matrix equals 


N 
YE, 2) = D> ui ui’). 


i=1 
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For use later, we shall also write down the diagonal part of the two-particle 
density matrix, also called the two-particle density, of a determinantal wave 
function. It is given by 


yO (Z1, 225 215 22) = VOR, ZV (Za, 22) — [YZ Za)?» B.1..29) 


Associated with the kernel y is the operator y“, which acts on functions 
of k variables in the obvious way by integration, L.e., 


(YW) Greet) = fyi zee BOWE zpdel de. 


It is called the k-particle reduced density matrix. It can also be written in 
terms of the partial trace, Tr“, as 


! 
@  _N' _q,a-op, 


Y= (Wb! 


For our purposes, Tr“ can be thought of simply as a mnemonic device for 
(3.1.27), with the relation between kernel and operator given by (3.1.22). 

As an operator, y“ has all the properties of a k-particle density matrix except 
for the normalization 


To justify this statement it is necessary to verify that y is positive semidefinite. 
For any ¢ € L?(R*; Cc"), and with Z6 := (Z1,...,Zx), we have 


(d, y¢) = [PERM YR, z'\dzqz 


Sa 2 
= Ay / | | Ozh (2, z4—)dz] dze*—), (3.1.30) 
j=l 


and this is non-negative. (The reader is invited to prove, using the Schwarz 
inequality, that these integrands are summable and that the interchange of the 
order of summation and various integrations is justified by Fubini’s theorem.) 

As aconsequence of being self adjoint and trace class, the kernel y also has 
an eigenfunction expansion 


yO, 2) = AP HOA), (3.1.31) 


j=l 
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with a > 0, fj € L7(R*; Ca") orthonormal, and 


j=l 


We also note that Wy is an H! density matrix if I is one. 


The expansion (3.1.31) permits us to define y“ on the diagonal, i.e., 
y®, z®) in analogy with (3.1.25). In particular, it should be noted that 
the one-particle density oy, defined in (3.1.4) is the diagonal part of y“. 

An important application of partial traces is the definition of the spin-summed 
density matrix. Consider the case of the one-particle density matrix y“, 
which is a positive trace class operator on the one-particle space L?(R?; C4) = 
L?(R*) ® C‘. By taking the partial trace over the C/ part, one obtains a positive 
trace class operator on L7(IR*), which we shall denote as 7’. Explicitly, in terms 
of kernels, 


q 
Vy Ga = ) 7G eix a): (3.1.32) 
o=1 
The trace of y“!) on L7(IR3) is the same as the trace of y“ on L?(R*; C2), namely 
te y(x, x)dx. The largest eigenvalue of 7“) may be larger than the largest 
eigenvalue of y“!; in fact, it is at most qg times as big. We denote the largest 
eigenvalue of y" and 7 by ||v IIo and ||? |l.o, respectively. We thus have 


hee = LY hess (GA33) 


a fact that will be useful later. 

For the purpose of this book the most important of the y™ are y“ and, to 
a much lesser extent, y. The Hamiltonians H, or energy functionals €(-) we 
shall consider in this book will have only one- or two-body terms in them. That 
is, formally 


N 
H=Johi+ D> Wy. 
i=l I<i<j<N 
This notation is meant to suggest that / is an operator (or quadratic form) on 
L?(R?; C4), e.g.,h = —A + V(x), and >> ;h; is simply )>,[—Ax, + V(x;)]. For 
the energy €(-), this translates into [(°;|Vx, Wl? + 0; V(xi)|w?)dz. Similarly, 
W is an operator on L*(R*;C%) @ L?(R?; C4). We then have the important 
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formula 
1 
E(V) = Tr AT =Trhy? + it Wy. (3.1.34) 


Again, the traces in (3.1.34) are defined by using the eigenfunction expansions 
(3.1.31) for y and y®.4 

Formula (3.1.34) raises a tantalizing possibility about computing the mini- 
mum (or infimum) of €(I°). Instead of considering all possible N-particle density 
matrices I’, it suffices instead to consider all possible two-particle density matri- 
ces y). Trivially y“” can be obtained from y™ by 


1 
() Tey 
i 7s ae 


+ Some readers might be concerned about the following technical point. We have defined Tr HT 
via the eigenfunction expansion of I’, and similarly we define Trhy™ and Tr Wy® via the 
eigenfunction expansions of y“ and y. It remains to check that these different expansions 
give the same answer, and that (3.1.34) holds. For example, ify“ = )*, «7, is the one-particle 
density matrix of [ = am AqTw,, is it true that 


do Kilwi@P = Soe fetes 22s zw Pde + de 


for almost every z = (x, 0) € R* x C?? The answer is yes, and can easily be seen as follows. 
If V is a bounded, measurable function on R? x C4, 


(ui, Vy ui) 
= ye / ui (Z)u;(2")V (2) Wa, 2, «++ Zv)WalZ', Za, «++ Zw) dz dz’ dzy + -- dew. 
For almost every Z2,..., Zy, we have 


» / Uj (z)uj(z')V (Z)o(Z, 22, +++, Zw) WalZ’, Z2,---, 2) dz dz’ 


= f Veale, 22... 20) Pde 
and hence 


Ds f vq Pde = Ten, VM w= Tra fl Vales 220+ zn de dea dew, 


Fubini’s theorem justifies the interchange of integrals and sums. Since this is true for every 
(bounded) V we arrive at the statement above. A similar argument applies to the two-particle 
density and to the kinetic energy in (3.1.34). 
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so that Tr HT really only depends on y. The difficulty is that we do not know 
how to characterize the set of two-particle density matrices y that arise as 
the reduction of some I’. If we did, then the N-body problem would effec- 
tively be reduced to a two-body problem. It is definitely not the case that every 
self-adjoint positive semidefinite operator, y, on L?(R?; C2) @ L?(R?; C2) that 
satisfies Tr y = N(N — 1) arises from some I" with the right symmetry. Some 
useful necessary conditions are known (see, e.g. [78, 147, 31]) but a non-trivial 
sufficient condition is not known. The quest for necessary and sufficient condi- 
tions has been a topic of research for many years — called the N-representability 
problem. We shall not mention it further here. 

The only k for which the allowed k-particle reduced density matrices y“ 
can be characterized simply is k = 1. A positive semidefinite operator y on the 
one-particle space L?(R*;C%) with Tr y = N will be called admissible if it is 
the one-particle reduction of some density matrix I’ on the N-particle space. 
The boson and fermion cases are very different and we give them separately. In 
the boson case all such operators y are admissible, but not all y are admissible 
in the fermion case. 


Theorem 3.1 (Admissible One-Body Density Matrices for Bosons). Let y be 
a self-adjoint, positive semidefinite operator on L*(R?; C4) with finite trace 
Try=N 


for some integer N > 1. Then there is a bosonic N-particle density matrix T 
such that y = NTr ‘(N-DT. Moreover, if N > 2, T can be chosen to be a pure 
state, i.e., T = ly for some w. 


Proof. If N = 1 we simply take ' = y, so suppose N > 2. Let { fj, A;}§2, be 
the orthonormal eigenfunctions and eigenvalues of y. With the choice 


lee) N 
wg) = NOP SAP TT Fi 
j=l i=] 


one easily checks that y is normalized and that y = NTr‘% a Boe | 


The following is the analogue of Theorem 3.1 for fermions. It is due to 
Coleman [31]. The conclusion (3.1.35) will be very important for us. 


Theorem 3.2 (Admissible One-Body Density Matrices for Fermions). Let 
y satisfy the hypotheses of Theorem 3.1. Then there is a fermionic N-particle 
density matrix T such that y = NTr““~ YT ifand only if y satisfies the additional 
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condition 


y <i. (3.1.35) 


It is not true, in general, that TY can be chosen to be a pure state. In particular, 
whenever y has N —1 eigenvalues equal to I and at least N + 1 positive 
eigenvalues then V' cannot be pure. 


Proof. We first prove that (3.1.35) suffices to insure the existence of a I’. Let 
A, > Az =>... denote the eigenvalues of y in decreasing order. By assumption, 
0 <A; <1 and ye Ay = N. We can assume that Ay+; > 0, otherwise a 
simple Slater determinant will satisfy the requirements. Note also that Aw+1 < 
N/(N +1) <1, since N > DA Ay = (NF Danas 

Let x11, be the characteristic function of [1, N], Le., xu,4j](J) = 1 for 1 < 
Jj < Nand xp.yiVj/) = Ofor 7 > N. Withe = min{Ay, 1 — Aw41}, we can write 


Aj = exp) + — 8) fj 
with 
1 : 
fi= ae (Aj — xn.) - 


Then 0 < f; < 1, and i Pak. 
We can now rearrange the sequence f; in decreasing order, and repeat the 
construction. After M iterations, we see that 4; can be written as 


M 
. M 
Ay = do cexe) + RM, 
k=1 
where x; is the characteristic function of some subset of N points in {1,2,...}, 
and eae cr < 1. In fact, if ¢, denotes the value of ¢ in the k’” iteration, we 
have 


k-1 
Ck = Ek [a _ &;). 
j=1 
M 


We can now distinguish two cases. If )-P°., cx = 1, then limy—oo ee Rv = 
0 and hence 


(oe) 
p= >. Gx): 
k=1 
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In particular, y can be written as a convex combination of rank N projections. 
If we take I" to be the same convex combination of the rank 1 projections onto 
the Slater determinants corresponding to the rank N projections, I" will satisfy 
all the requirements. 

It remains to consider the case )~7°, c, =: d < 1. The sequence Ri con- 
verges strongly to some sequence R; as M — oo, with the property that O< 
Rj <1—dand)" >, Rj = NU — d). That is, limy—oo 72 [Ri — Rj| = 0. 
This follows easily from the dominated convergence theorem. For the following 
reason one Ny concludes that limy_,.. €, > 0. Let us reorder the pele 
Rj ae a y pa) Ck) age! order and call the resulting sequence r™” ;_ - Since 
0 - ca < sole a oa = WN, we conclude that rv, , < < N/(N + 1), as we noted 
cari. Then 1 — ri +1) > 1/(N + 1), which implies that e, can only go to zero if 
r\! goes to zero. But ry converges, as M — oo, tor;, which equals the sequence 
R;/(. — d) rearranged in decreasing order. The numbers r; are ordered, lie 
between 0 and 1, andsum to N. Hence ry > 0. This implies that limy_,4 €, > 0. 

For arbitrary numbers 0 < : < land M E NU {oo}, 

M 

sa) Ta —ej)=1 -[le =A); (3.1.36) 

k=1 j=l 
A simple proof of this identity (due to Ya. eae is to rewrite it as a telescopic 
sum of the form ace — a.) = ao — ay, where a, = ac — ¢;) for 
k > 1 and a = 1. We apply this to our situation above and note that the left 
side equals d when M = ov. The right side equals 1, however, if the e, are not 
summable which, in particular, is the case if limy.. €, > 0. This contradicts 
the assumption that d < 1, and proves that (3.1.35) is indeed sufficient for 
N-representability. 


Remark 3.2. What we have really just shown is that every sequence of numbers 
A; with 0 < A; < 1 and a 4; = N can be written as a convex combination 
of characteristic functions of N elements. Since the latter are the extreme points 
in this convex set, this fact follows from a general result known as Choquet’s 
theorem. In finite dimensions, it goes back to Caratheodory and Minkowski [151, 
Sect. 17]. For the infinite dimensional version, we refer the interested reader to 
[166, Sect. I.5]. Instead of using this abstract result, however, we chose to give 
the elementary explicit construction above. 


We shall now prove the converse, namely that y = NTr‘Y~)T implies 
(3.1.35), ie. (6, yb) < (@, ¢) for all @ € L7(R?; C2). 
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First, let us assume that I" is pure, ie, [=Ty with (Wy, y)=1. 
Define the annihilation operator Cy, : Hy = /\" L?(R°;C%) > Hy-1 = 
AN“ L2(R%; C4) by 


(Cy.oW)(Z1,-..,%N—-1) = Ni? i W(z1,...,Zn)@(zw)dzy. (3.1.37) 


Clearly, 


(Cy.oV, Cn.6W)xHy_. = (Gv). (3.1.38) 


If x is any function in 7/,_, then one easily checks that 
(x, Cw.p Wnty = (ChgXs Wry 


with c| ¢ | Hn-1 — Hy being the creation operator given by 


(Ch yOG1. 520) = NPAs Zw OR}. — B.1.39) 


Here, A is the antisymmetrizer (3.1.17). A simple algebraic exercise shows that 


CyseCharg + Ch glue = @, Oly (3.1.40) 


where Ix is the identity on 71). Thus, returning to (3.1.38), 


(6,79) =(W, Ch gv gWrty = @ OA, Wty — Ws Cw oC has gh ry 
= (6,9) — (Chi gts ChargW) < @.). (3.1.41) 


This is the desired goal. If is not pure, we can apply (3.1.41) to each term in 
(3.1.21) and then use iA =1. 

To prove the last sentence of the theorem it suffices to find a y satisfying 
(3.1.35) but such that no pure ['y exists for which y = NTr a) We First, 
suppose N = 2 and that y“ has three non-zero eigenvalues and eigenfunc- 
tions (1, f), (uw, g), (A — pw), h) with 0 < yw < 1 and with f, g, h orthonormal 
functions in L?(R*;C%). All other eigenvalues are zero. Assume now that 
y‘? is the reduction of some I',. If we take ¢ = f in (3.1.41) then, since 
(fy? f) =, f) = 1, it follows from (3.1.41) that a ;v = 0. By (3.1.40), 


Ch Cph =v. (3.1.42) 
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The following explicit form of (3.1.42) is a consequence of the definitions 
(3.1.37) and (3.1.39): 


A { re) Wi, £1) Fades | = W(Z1, 22). (3.1.43) 


Let 2~!/? f:(z1) denote the integral in (3.1.43). Since (f, f) = 1, it follows that 
(fo, fo) = 1. Then (3.1.43) reads 


W(R1, 22) = 27? detl fi DH jar (3.1.44) 


with f; = f. Moreover, 


He p= / W(Z1, 22) f (21) f (Z2)dz1dz2, 


and this vanishes because f(z) f(Z2) is symmetric and y is antisymmetric; thus 
(fi, f2) = 0 and (3.1.44) is correctly normalized. From (3.1.44), we find that 
y has only two non-zero eigenvalues and eigenfunctions (1, f;) and (1, fs), 
which contradicts the initial assumption of eigenvalues 1, w, 1 — pw. 

The same idea can easily be generalized to show that for any N, a pure state 
Ty, cannot give rise toa y“” that has (N — 1) eigenvalues | and all the remaining 
eigenvalues strictly less than 1. | 
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In this section, we will describe in detail some of the many-body Hamiltonians 
whose ground state energy will be studied in this book. We also define what is 
meant by stability of the second kind. We start with the case of static nuclei, and 
consider the case of dynamic nuclei in Subsect. 3.2.2. 


3.2.1 Many-Body Hamiltonians and Stability: Models with 


Static Nuclei 
The Coulomb Hamiltonian for N non-relativistic electrons interacting through 
electrostatic forces with M static nuclei fixed at positions Ri,..., Ry € R? is 
N 


1 
Hy.u = —5 Ai + aVc(X, R). (3.2.1) 


i=1 
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The function Vc(X, R) is the total Coulomb energy defined in Section 2.1.4, 
Eas. (2.1.21)-(2.1.24) and a := e? /hc © 1/137 is the fine-structure constant. 
The units are chosen as explained in Section 2.1.7. Note that the only free 
parameters besides N and M are the nuclear charge numbers Z),..., Zy. 
Although they are integers in nature, we shall not assume that here. 

A simple but important observation is that (3.2.1) is symmetric under permu- 
tation of the electron labels. Hence it makes sense to restrict it to antisymmetric 
wave functions, as appropriate for electrons. For yy € Ke 1°’; C4) a totally 
antisymmetric wave function of space-spin variables, we write the quadratic 
form associated with Hy, as 


N 
Ev() = (Ww, Hy.mW) = D5 Tj, taVy (3.22) 
i=l 


(where Ty, and V,, are defined in (3.1.10) and (3.1.12), respectively). Recall that 
Z; = (x;,0;) where x; € R° and the spin index o; takes values in {1,..., q}. 
In the case of electrons, g = 2 of course. If electrons were bosons we would 
then consider (3.2.2) with w(z1, ..., Zy) Symmetric. The minimization problem 
associated with (3.2.2) is now 


Ey(Z, R) := inf{Ey(W) : w bosonic or fermionic, ||W||2 = 1, y € H'(R*%)} 
(3.2.3) 


with the obvious notation Z = (Z,,..., Zy)and R = (Ri,..., Ry). En(Z, R) 
is called the ground state energy. Note that for this problem the positions of 
the nuclei are fixed parameters and therefore the repulsion U is unimportant for 
the calculation of Ey(Z, R), although it is crucial for questions such as binding 
and stability of the second kind. 

Thecondition Ey(Z, R) > —oc for all distinct values of R,,..., Ry iscalled 
stability of the first kind. For non-relativistic Coulomb systems this kind of 
stability is not very hard to prove. To see this, simply omit the positive repulsive 
parts. The attractive parts can be studied one particle at a time; they satisfy the 
stability conditions in Section 2.2.1, and hence stability of the first kind follows. 
The estimate for Ey(Z, R) obtained by this simple argument will depend on the 
nuclear coordinates R. Our goal is to eliminate this dependence by defining the 
absolute ground state energy to be 


Ey,m(Z) := inf {Ey(Z, R): Re R*’}. (3.2.4) 
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With it we can define stability of the second kind, which says that 


Ey ,m(Z) = —&(Z)(N + M) (3.2.5) 


for some number (Z) that depends only on Z := max(Z),..., Z). Obviously, 
(3.2.5) is much harder to prove than stability of the first kind, for UR) now plays 
a decisive role. (Without U(R), the minimum energy would occur at Ry = 0 for 
all k, and would hence be proportional to —N(}°, Z,)° ~ —NM?.) As we shall 
see, stability of the second kind holds only for fermions. 

As we mentioned in the prologue, the linear lower bound (3.2.5) is essential 
for our understanding of ordinary matter, where a full glass of water has twice 
the energy of a half-filled glass. Were the energy to grow by a larger power of 
the number of particles, one could extract a huge amount of energy simply by 
pouring one half-filled glass of water into another. 

The relativistic analogue of (3.2.1) is given formally by 


N 
Hey =~ (/=A; ey 2 1) + aVc(X, R). (3.2.6) 


i=1 


The symbol ./—A; + 1 — 1 is defined by considering the quadratic form asso- 
ciated with (3.2.6): 


N 


EN) =, Ayu) = DT, +aVy, 


i=1 


where T;, is now defined in (3.1.11). Analogously to (3.2.3) we can define the 
ground state energy of EX'(y) to be 


EX'\(Z, R) = inf{EX'(W) : w bosonic or fermionic, || vl = 1, w € H'/7(R°%)}. 


It is by no means clear that EX\(Z , R) is finite. In fact, as we shall see in 
Chapter 8, even for the one electron, one nucleus case, Ew\(Z) is only finite 
if Za <2/m. From this we would expect that, provided Z;a < 2/z for all 
i=1,...,™M and provided that the nuclei stay fixed and apart, stability of 
the first kind should hold. In fact this is true (see Lemma 8.3 in Chapter 8) 
independent of the statistics, Fermi or Bose. Stability of the second kind is 
however a fairly deep result, relativistically. In addition it will turn out that a 
bound on the fine structure constant a is also necessary to insure stability of the 
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second kind. Stability requires an understanding of the problem in the extreme 
relativistic limit where the mass of the electron no longer plays a role and can 
be set equal to zero. 

The reason that the extreme relativistic limit is relevant is the pair of 
inequalities 


lpl-l1<VJVp?+1-1<|Ipl, 


which are easily verified. Using the definition of the relativistic kinetic energy 
via the Fourier transform (cf. (3.1.11)) one obtains the inequalities 


Ely) —N < EM < EX), 
where 
~ N 
EN) = DOW, [pilW) + @Vy. 
i=l 


Thus the stability of the second kind of gel) and Ely) are equivalent. Now 
we note that by scaling yy b> yy, given by 


Wi(t1, 015...5%N,0n) = PNP WAX, O15...5;A"N, ON), 
and also Rt AR, 
EN) = AER (W). (3.2.7) 
Thus, 
EN(Z) = inl (EVI): € APR), Il = 
is either —oo or zero. By (3.2.7), if Ely) < 0 for some w then EOD) = —00, 
while EX'(Z) = 0 if and only if E€'(y) > 0 for all y. Thus, the stability of 
the first kind and of the second kind for EX) are equivalent, and both are 


equivalent to the statement that EM) > 0 for all & and all R. 
In Chapter 10 we consider the effect of external magnetic fields 


B(x) = curl A(x). 
For spinless particles this entails replacing p by p + /@A(x), ie. 
—A > (-iV + JaA(x))” (3.2.8) 


in both the non-relativistic and relativistic cases. With the replacement (3.2.8) 
the Hamiltonians (3.2.1) and (3.2.6) become Hy, y(A) or Hy y(A). It will 
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turn out that the replacement (3.2.8) does not significantly affect the stability 
of matter question. In the non-relativistic case, this is a consequence of the 
diamagnetic inequality discussed in Chapter 4. The same holds in the relativistic 
case but the situation is much more complicated and will be discussed in detail 
in Chapter 8. We can generalize the notion of stability of the second kind (3.2.5) 
by generalizing (3.2.4) to include B, i.e., 


Ey,m(Z) = inf{Ey(Z, R, B): Re R™}, 


where E'y(Z, R, B) is the ground state energy with field B. (By gauge invari- 
ance, the ground state energy depends on A only through B.) Stability of the 
second kind continues to hold for fermions, i.e., Ey, y(Z) > —S(Z)(N + M). 

The situation is drastically changed if we allow the fermions to interact with 
the magnetic field through their spin, i.e., 


Ja + 

Hy.m > Hy,u(A)~ ~P8 ae - B(xi), (3.2.9) 
where g is the gyromagnetic ratio of the electrons — usually taken to be 2 — and 
where o = (o!, 07, o*) denotes the vector of Pauli spin matrices (see Chapters 2 
and 10). Now, for any fixed R and Z, the last term in (3.2.9) allows us to drive 
the energy to —oo by letting B — oo in a suitable way. To correct this, and 
restore stability of the second kind, we add one more term to the energy — the 
magnetic field energy defined in (2.1.16) 


1 
Emag(B) = 5 / |B(x)|?dx. 
R3 


Its role is similar to that of the nuclear repulsion U. Its inclusion prevents B from 
becoming too large — provided Za? and a are both not too large, and0 < g < 2. 
This is similar to the requirements for stability in the relativistic case. 


3.2.2 Many-Body Hamiltonians: Models without Static Particles 


For all the results in the following chapters where stability of the second kind 
for fermions is proved, the nuclei can be considered to be static. Their kinetic 
energy is not needed for stability. 

As we shall see, a Coulomb system of bosons with static nuclei is not stable 
of the second kind. In the non-relativistic case the energy Ey y(Z) grows 
as —[min(N, M)]>/?. One might think that by taking the kinetic energy of the 
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nuclei into account, the instability can be ameliorated. This is in fact so if the 
nuclei are fermions, but if they are also bosons the system is still unstable, 
but not as badly. The energy only grows as —[min(N, M)]’/°. This will be 
discussed in Chapter 7. 

If both the positive and negative particles are dynamic, the wave functions are 
now functions of X and R and the corresponding spin variables. The energy of 
a non-relativistic system becomes 


M 
Enh) > Ev) + 9° T] 


j=l 


where Ey(w) is given in (3.2.2), and where T = (2p)! f IVawl is 
the kinetic energy of the positive particles with coordinates R,,..., Ry. 
Their mass is denoted by > 0. Of course yw is now in L2(RAN HA), 
For simplicity neglecting the spin variables in the notation here, we 
have Ww = W(x1,...,XNy;Ri,..., Ry) and it satisfies appropriate symme- 
try requirements separately for permutations of the coordinates x and R. 
(Cf. Section 3.1.3.) That is, it is antisymmetric in the electron coordinates x; 
and either symmetric or antisymmetric in the nuclear positions R; depending on 
the statistics of the nuclei. 

Another model without static particles considered in this book is a relativistic 
gravitating system. It consists of neutral particles of mass m,, e.g., neutrons, 
which interact only via the gravitational attraction. A more thorough discussion 
of the feasibility of such a model for neutron stars is given in Chapter 13. In any 
case the Hamiltonian of this model is then given by 


N 
1 
_A; a _ 2 
de € Aj +m; ms) Gm; 2 is aan 
i=l 1<i<j<N J 
where G is the gravitational constant, which is 
G = 6.674 x 1078 grams! cm? sec™”, (3.2.10) 


as determined by Cavendish in 1798 [28]. 
In dimensionless terms (in units where the neutron mass is one) this expression 
takes the form 


3 (V=arI J=ieei = 1) -« > — (3.2.11) 


i=l l<i<j<N |x; — Xj 
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where k = Gm? /hc is a dimensionless constant. If m, = mass of a neutron then 
«7x 10-77, 


The smallness of « is the reason why gravity can often be neglected, except in 
very large systems. 
The energy functional associated with (3.2.11) is given by 


a = ya. wv 2 ee 


ie oy x;| 


with ty defined in (3.1.11). It cannot be expected that this functional is bounded 
from below, in general. It has to be expected that beyond a critical N it becomes 
unbounded from below. As it will turn out, this critical number, N,., is the famous 
Chandrasekhar mass limit; it will depend on the statistics and it will be smaller 
for bosons than for fermions. The reader is referred to Chapter 13. 

One might consider a similar model for white dwarfs but then one has to take 
the electrostatic forces into account. The functional for a system of N electrons 
and M nuclei (usually helium nuclei) of mass m, in units of the electron mass 
and nuclear charge Z is given by 


M 


N ; ‘  y, : 
DUT + Ty" —@Z tem," | > aw 
i=1 | : 


j=l i=l j=l! 


1 
Zr = ee 
a a a iki — RI” 
l<i<j<M 
a 1 
+(aZ—Km,*)|¥, >> ——y]. (3.2.12) 
l<i<j<N Ix; — xj 


Here, e and n refer to electrons and nuclei respectively. The operator T° corre- 
sponds to J—Ax, +1 — | while T/” corresponds to ,/—A Rr m> — My. 

The conclusion about stability in Chapter 13 will be largely the same as for 
the neutron star model. There are still some open problems, however, which will 
be discussed in Chapter 13. 
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3.2.3 Monotonicity in the Nuclear Charges 


We are interested in finding the ground state energy for a given number N of 
electrons and a certain number M of nuclei with charges Z = (Z),..., Zy). 
These nuclei might either be fixed at locations R = (R,,..., Ry) or dynamic, 
in which case there are no fixed locations. 

Recall that Ey(Z, R) denotes the ground state energy in the fixed nuclear 
case, and Ew, y(Z) is the ground state energy in the case of dynamic nuclei. In 
what follows, the form of the kinetic energy of the electrons and nuclei is not 
important. What is important is the fact that the Hamiltonian depends linearly 
on the nuclear charges. (Hence we have to exclude the case where the dynamic 
nuclei are coupled to a magnetic field, since in this case the nuclear charge 
appears in the kinetic energy in a non-linear way.) 


Proposition 3.1 (Monotonicity in the Nuclear Charges). Assume that Z; < Z 
forallk =1,...,M. Then 


Ey(Z, R) => min Ey((Z, Z,..., Z), R). (3.2.13) 
RCR 


Moreover, if Z, < Z,. for allk =1,...,M, then 
Ey,m(Z) > Ey.m(Z). (3.2.14) 


In other words, in the case of fixed nuclear locations R = (R,,..., Ry), a 
lower bound is obtained by replacing each nuclear charge Z, by either 0 or 
the common upper bound Z, and taking the minimum over all such choices. 
In the dynamic case, even more is true, namely the energy is monotone non- 
increasing in each nuclear charge. The usefulness of this proposition lies in the 
fact that in order to prove stability of matter (of the first or second kind) it 
is enough to consider the case of equal nuclear charges. This observation was 
first made in [38, Lemma 2.3 et seq.]. We shall utilize this fact in the chapters 
to come. 


Proof. Each Z; appears linearly in the Hamiltonian, i.e., Hy,y is an affine 
(operator valued) function of Z; for fixed Z;, 7 # k. Its ground state energy is 
the infimum over (y, Hy, y) and hence is concave separately in Z;. (Note that 
this does not mean that it is a jointly concave function.) For fixed values of Z;, 
J #k, the minimum of Ey(Z, R) over Z; € [0, Z] is attained either at Z, = 0 
or Z; = Z. By applying this argument to all the nuclear charges separately, we 
end up with the statement (3.2.13). 
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In the dynamic case, the same concavity argument applies. However, for fixed 
Zj,J #k, the energy Ey,y(Z) can not be increasing for small Z;. If it were, the 
energy could be lowered by setting Z; = 0 which, however, has the same effect 
as moving the corresponding nucleus to infinity (which can be accomplished 
with infinitesimal kinetic energy cost for the nucleus). Hence Ey, is less than 
or equal to its value at Z; = 0, and hence must be monotone decreasing by 
concavity. This proves (3.2.14). | 


3.2.4 Unrestricted Minimizers are Bosonic 


For fermionic systems, the imposition of antisymmetry has a major impact on 
the stability question. For bosons, however, the imposition of symmetry often 
plays no role. One might as well minimize over all wave functions, irrespective 
of symmetry. That this is so (in the absence of magnetic fields) is the content 
of Corollary 3.1. When magnetic fields are present, the symmetry requirement 
is important and not automatic, although one does not expect it to affect the 
question of stability significantly. 

The main result, Corollary 3.1, is a consequence of the following abstract 
theorem. 


Theorem 3.3 (Symmetry of Minimizers). Let E(w) be a (not necessarily 
bounded) quadratic form on L?(R“") for some d (e.g., d = 3) with the properties 
that 


(a) E(w) = cy, W) for some constant c, 


(b) E(w) = E(w), where |W \(X) = |W], 
(c) E(w) = ECW) for all permutations x € Sy permuting the N particles, i.e., 


Wr(¥1,...,XN) = W(Xz(1); sey Xn(N)): 
Then 


Eo = inf{E(W) : (, W) = Y= inf{EQ) sw) = 1, Ue = W for all rr}. 


Proof. By assumption (a), Eo = inf{E(w) : (W, Ww) = 1} is finite. By assumption 
(b) and the fact that (Ww, vv) = (|wW|, |w|), we can assume that w = |y| without 
loss of generality when computing the infimum of €(y). The same is true 
when computing the infimum over symmetric yy, because |y| is symmetric 
if y% is symmetric. For given non-negative Ww, we write w = w; + w-, where 
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Ws = (1/N!) 0, Wa denotes the symmetric part of y and y, is the remainder. 
We note that w, is also non-negative. Using assumption (c) we have” 


E(w) = E(Ws) + Er) (3.2.15) 


and, similarly, 


(WW) = (Us, Ws) + Wr, Wr). (3.2.16) 
Note that since (Wz, Wz’) = 0, 


(Ws, Ws) = NV 


In particular, y, does not vanish identically. Since E(w,) > (W;, Ws)Eo and 
E(w,) = (W,, W,) Eo, we conclude that if w is a minimizer for € so is W;/||Wsl. 

If there is no minimizer, we can apply the same argument to a minimizing 
sequence. That is, if y” is a sequence of non-negative normalized functions 
with lim, E(w) = Ep, then also limy oo E(W)//(W, Ww) = Ep. This 
proves the theorem. | 


Corollary 3.1 (Unrestricted Minimizers for Relativistic and Non-Relativistic 
Systems are Positive and Bosonic). Consider an energy functional 


E(w) = Ty + Wy, 


corresponding to a Hamiltonian 


N 
H=)°T, + W(X) 


i=1 
with E(w) = (Ww, Hw). Here T; is either (2m)~'p?, wy € H'(R®%) (non rela- 
tivistic) or T; = /P? +m2—m,w € H'/?(R%) (relativistic). (Note that all 


> To see the equality in (3.2.15), consider first the sesquilinear form associated with €, given 
by E, W) = Ai E + iv) — AI) TES — i) + CH/DEG + W) — U/NEO +H). We 
can write E(Ws, Wr) = (NI YO, EW, W — Wo). Since E(Wa, W — Wo) = EW, Vat — 
Wz-1¢), this can be further written as E(Wy, VW) = =(N!)~? ae Doe E(w, Wr-l — Wa-1g) = 0. 
Hence E(yfs + Vr) = ECs + Vrs Ws + Wr) = Es, Vs) + Ee Wr) = Els) + EM). In 


other words, the cross-terms vanish. 
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the particles have the same mass.) The measurable function W is such that 


Ty + Wy = cy, w) 


1/2 


for some constant c and all in H' or H"'/*, Furthermore, W(X) is assumed 


to be symmetric, i.e., 
W(x1,...,¥n) = Waray, -.., Xxw)) 


for all permutations m € Sy. 

Then E(w) satisfies all the assumptions of Theorem 3.3. In particular the 
unrestricted infimum of E is the same as the infimum of E over non-negative 
totally symmetric functions wy. 


Proof. First notice that Ty > Tjy). In the non-relativistic case, this simply fol- 
lows from the fact that 


IV f(x) = IVIF@I? 


for almost every x, which holds for any function in H!(R®) [118, Thm. 7.8]. In 
the relativistic case, one uses the integral formula [118, Sects. 7.11 & 7.12] 


= 2 (d+1)/2 
A VipP em p= ff SOE (Fe) Kasnyatmix — ybaedy 


Réx Rd 


(3.2.17) 


(with K the modified Bessel function of the third kind, which is non-negative) 
and the fact that | f(x) — f(y)| = Il f(~)| — | fO)I|. A proof of this formula for 
m = 0 will be given in Lemma 8.1 in Chapter 8. 

Moreover, Wy = W)y);. That E(w) = E(w) is obvious. Hence the last state- 
ment follows from Theorem 3.3. | 


Remark 3.3. The proof above would not work in the case of magnetic fields, 
since the energy functional in this case does not decrease by taking the absolute 
value of the wave function. In fact, a minimizer will not be positive in this case, 
or even real. Moreover, the conclusion of the corollary itself does not hold, 
in general, if magnetic fields are present. Minimizers need not be bosonic. In 
fact, the bosonic symmetry requirement is essential for understanding vortex 
formation in dilute Bose gases subject to homogeneous magnetic fields [162, 
126]. 
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Remark 3.4. The corollary obviously generalizes to the case in which there 
are several species of particles and T and W are separately symmetric in the 
variables corresponding to each species. Then the ground state energy without 
symmetry restrictions coincides with the energy in which all the species are 
bosonic. 


CHAPTER 4 


Lieb-Thirring and Related Inequalities 


4.1 LT Inequalities: Formulation 


The non-relativistic versions of the inequalities discussed in this chapter were 
invented in 1975 [134, 135] in order to give an alternative, simpler proof of the 
stability of non-relativistic matter, first proved by Dyson and Lenard [44]. The 
basic idea, as we said in Section 1.1, was to relate the stability of quantum- 
mechanical Coulomb systems to the simpler Thomas—Fermi theory of Coulomb 
systems, which was known to have the required stability properties. 

These inequalities have many extensions and a sizable literature, and are 
collectively known as Lieb-Thirring (LT) inequalities. While we do not follow 
the Thomas—Fermi route to stability in this book, for reasons explained earlier, 
the inequalities will, nevertheless, play an essential role in various aspects of the 
relativistic and non-relativistic theories. 

In this introduction we shall explain the various inequalities. The next section 
gives their connection with kinetic energy inequalities, which was the original 
1975 motivation. The rest of the chapter is devoted to proofs and can be skipped 
by anyone interested only in applications. The proofs require a theorem about 
traces of operators, which is given in an appendix to this chapter. The proofs 
given here follow closely the discussion in [118, Chapter 12], except for the 
theorem in the appendix to this chapter, which is only stated but not proved 
in [118]. 

The LT inequalities concern the Schrodinger operator on L7(IR“), the space 
of square-integrable functions of d variables. For our purposes, d = 3 is the 
relevant case, but we may as well discuss all d > 1. The inequalities depend 
importantly on d. They are concerned with the sum of the y" power of the 
negative eigenvalues. For the application in later chapters, the sum of the negative 
eigenvalues (that is, y = 1) will be the most relevant, but we shall also have use 
for other values of y (in particular, y = 0 and y = 1/2). 
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Quite a few mathematicians have devoted a lot of time to explore generaliza- 
tions of the inequalities and bounds on the optimal constants. For instance, there 
are generalizations to Riemannian manifolds. Only the Euclidean case will be 
discussed here. 

The Schrodinger operator on L7(R%) is! 


H=-A+V(x) (4.1.1) 
and the eigenvalue equation is 


—Ayp(x) + Vix)w(x) = Ev(x) (4.1.2) 


with w € L*(R7). We assume that V is such that H is self-adjoint and bounded 
from below. For this purpose is suffices that V satisfies the requirements in 
(2.2.14), as explained in Section 2.2.1. The non-positive (i.e., negative or zero) 
eigenvalues, ifany, are labeled Ey < FE, < Ey <--- < 0, repeated according to 
multiplicity. There can be none, finitely many or infinitely many. They can be 
defined via the variational principle, see, e.g., [118, Sect. 11.2 et seq.]. 

The LT inequalities bound power sums of the negative eigenvalues by integrals 
of V_, the negative part of V. Recall that any function V can be written as 


V(x) = Vix) — V_(x) (4.1.3) 


with 
Vi(x) = max{V(x), O}, V_(x) = max{—V(x), O}. (4.1.4) 


With this convention, V_(x) > 0. We assume that V_ vanishes at infinity, i.e., 
the measure of the set where V_(x) > t is finite for all t > 0. 


4.1.1 The Semiclassical Approximation 


Before presenting the inequalities let us discuss their ‘semiclassical’ interpre- 
tation, which will make them more transparent and natural. According to the 
semiclassical approach, which goes back to the earliest days of quantum mechan- 
ics, each volume (277)“ in 2d-dimensional phase space (consisting of pairs of 
points (p, x) with p € R¢@ and x € R%) can support one quantum state. (If we 
restore Planck’s constant h then the volume is (27h)? = h“.) This prescription 


' Tn this chapter, there is no factor 1/2 in front of the Laplacian, as there is elsewhere in the book. 
This is the standard convention used by spectral theorists. 
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can be quantified by using it to ‘calculate’ the number of negative (actually, 
non-positive) eigenvalues by integration, as follows: 


> Ie @ ery i @(—p* — V(x))dpdx, (4.1.5) 
jz0 RéxR¢ 


where ©(s) = 1 if s > 0 and = Oif s < O. Our notation is that |E;|° = | even 
if E; is a zero eigenvalue. The integral in (4.1.5) can easily be evaluated by 
first doing the p integration over the ball in R¢ of radius ./V_(x). This integral 
is well known to be (see, e.g. [118, p.6]) [24/*/ P(d/2 + 1)] V_(x)4/?. From 
this we conclude that the semiclassical approximation to the number of negative 
eigenvalues is (4a) 2" /V(d/2+1)] ie V_(x)4/*dx. Note that this quantity 
does not depend on V,. 

We can go further and ‘calculate’ power sums of the negative eigenvalues, 
Se |E;|”, for all y => 0, by mine the function (277)~@| p* + V(x)|” over 
the subset of phase-space in which p* + V(x) < 0. This is an easy exercise, 
similar to the one just mentioned. One does the p integration first and finds the 
semiclassical approximation 


DIE; aes J V_(x)’*4dx (non-relativistic), (4.1.6) 
jz0 


where Di in (4.1.6) is the non-relativistic ‘classical’ constant 


Liy=anyt fap yap 
R¢:|p|s1 
_ ryv+) 
~ (4r)4/2T(y + 1 +. d/2) 


(non-relativistic). (4.1.7) 


In a parallel fashion one can derive a semiclassical estimate for the negative 
eigenvalues of the operator 


H = (—A)*’ + V@) 


for any s > 0. The only other case that will be of interest in this book is 
s = 1/2, which corresponds to the ‘ultra-relativistic’ Schrodinger operator 
discussed in Section 2.2.1. Similarly to the non-relativistic case, we integrate 
(2s)~¢| |p| + V(x)|” over the phase-space subset in which | p| + V(x) < 0. The 
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result is the semiclassical approximation 
> Fee oor / V_(x)’*4dx (relativistic), (4.1.8) 
j20 Rd 

where Bs q 18 now the relativistic ‘classical’ constant 


My + DPdd + 1/2) 
m4@+V/P2T(y + 14d) 


Ly = (Qny-* / (1 — |p|)"dp = (relativistic). 


a\ele 
er (4.1.9) 
There are many theorems, which we shall not explore here, that state (under 
some smoothness conditions on V) that the formulas (4.1.6)—-(4.1.9) are asymp- 
totically exact for the potential AV as A — +00. The question we address here is 
whether (4.1.6) and (4.1.8) can be turned into inequalities for arbitrary V, with 
only the property that V_ is in the appropriate L’?(IR“) space and without 2 > 
++oo. Naturally, we might expect that such inequalities will only hold with the 
constants Lo 4 on the right side replaced by some other constants L,, 4. The fact 
that Ly q 1s valid asymptotically implies that L,, 7 can never be smaller than Le a 
An extension that will be important for us is the inclusion of a magnetic field. 
That is, the replacement? 


ha (=f AG), (4.1.10) 


where A is some vector field (with the physical interpretation that curl A(x) 
equals the magnetic field B(x)). From the semiclassical perspective, the intro- 
duction of A makes no difference; we replace Pp by (p+ A(x))* but then we 
can make the change of variables p —> p — A(x) in the integration leading to 
(4.1.6) and end up with the same result as before, namely Le Pai os +4/? | This 
independence of A(x) also appears in classical statistical mechanics and leads to 
the observation of Van Leeuwen [182], in the early days of quantum mechanics, 
that there is no orbital diamagnetism, classically, and, therefore, orbital diamag- 
netism must be a quantum-mechanical phenomenon. It might be supposed that in 
the inequalities we seek an allowance might be necessary for a small increase of 
L,,a if one wants to have a value of L,,.q that is independent of A. Nevertheless, 
itis a fact that all presently known values of L,, z, both sharp ones and non-sharp 
ones, hold for arbitrary A fields. The unknown sharp constants might have to 


2? For notational convenience, we shall absorb the fine-structure constant a into A in this chapter, 
and write p + A instead of p+ /aA. 
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be modified to allow for an A field, but this will have to be decided by future 
developments. The reason for this is that all known proofs use the resolvent 
of —A in one form or another, and it is well known that this resolvent satisfies 
a diamagnetic inequality. We shall discuss this further in Section 4.4. 

It turns out that the desired inequalities can be achieved if and only if y and 
d satisfy certain conditions (stated in the theorems below). These conditions 
are optimal, meaning that otherwise no inequality of the desired form can hold 
with any fixed, finite value of L,, a. The necessity of y > 0 ford = landd = 2 
in the non-relativistic case, for example, comes from the well known fact that 
when d < 2 any arbitrarily small, negative V always has at least one negative 
eigenvalue. For d = 1, this negative eigenvalue is of the order 47 if V is replaced 
by AV for some small parameter A, and hence an inequality of the desired form 
can only hold if 2y > y + 1/2, or y > 1/2. 


4.1.2 The LT Inequalities; Non-Relativistic Case 


We shall first discuss the LT inequalities in the non-relativistic case. 


Theorem 4.1 (Non-Relativistic LT Inequality). Fix y > 0 and assume that 
the negative part of the potential V_ satisfies the condition V_ € LY*4/?(R®). 
Assume that A € Le (eR)? Let Ey < E, < Ey <--- be the non-positive 
eigenvalues, if any, of (—iV + A(x))” + V(x) inR¢. Then, for suitable d, defined 
below, there is a finite constant Ly qa, which is independent of V and A, such 


that 


> IE < Lye [are (4.1.11) 
j20 Ra 


This holds in the following cases: 
1 
>. d= 1, 
Ve ser 


y>0 ford =2, (4.1.12) 
y>0 ford>3. 


3 The space L? 


2, (IR) consists of functions that are square integrable in any ball in R@, but not 
necessarily in the whole of R*. The notation L?(R“;IR“) means real-vector-valued functions 


such that each component is in L?(R®). 
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Otherwise, for any finite choice of Ly,q there is a V that violates (4.1.11). We 
can take 

(d+y)Py/2y 

Lya = (ny 42" y } 20(v+1+d/2) 

Jay? —1/4) ifd=1,y > 1/2. 


ifd>1,y >0ord=1,y>1, 


(4.1.13) 


Remark 4.1. The fact that Ey > —oo under the assumption on V stated in The- 
orem 4.1 already follows from the discussion in Section 2.2.1, Eq. (2.2.14). The 
necessity of the conditions (4.1.12) was explained at the end of Subsection 4.1.1. 


The bounds (4.1.13) presented in this theorem are certainly not the best 
available, and they can be improved. We list them since they can be obtained 
relatively easily with the methods first used in 1975 [135], as shall be explained 
below. These methods work only for y > 1/2 in case d = 1, and y > 0 in case 
d > 2, however. The extension to y = 1/2 inthe case d = 1 was done by Weidl 
[185], and the sharp constant was obtained by Hundertmark, Lieb and Thomas 
[94]. The y = Ocase for d > 3 was independently proved by Cwikel [35], Lieb 
[112] and Rosenblum [152] and is now known as the CLR bound. Their proofs 
are very different from each other and will not be presented here. Other proofs 
were given later by Li and Yau [107], Fefferman [56] and by Conlon [33]. A 
bound that is closely related to the CLR bound is in Corollary 4.2 on page 80. 

The sharp (= optimal or best) value of L, 4 is known in some cases (but not 
in the physically most interesting case, y = 1, where an upper bound on L,, 4 
will have to suffice for the present). In some cases Lyg = LY 4 and in others 
Lyd > iat It is conjectured [135] that L,;3 = EA 5. The best we have at present 
(due to Dolbeault, Laptev and Loss [41]) is 


ies aR L', © 1.814L%, © 0.0123. (4.1.14) 


In fact, it was shown in [41] that 


L for all y > 1 and for all d > 1. (4.1.15) 


< Uv L° 
yd = a yd 


We emphasize that the LT inequalities with these constants hold for arbitrary 
magnetic vector potentials A. Moreover, [102, 2, 15] 


Ly@= ae for all y > 3/2 and for alld > 1. (4.1.16) 
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It is also known that Ly ¢ > Lt. if y < 1 [88]. For some review articles see 
[93, 102]. 
For y = 0, the currently best known constant for d = 3, as obtained in [112], 
1S 
1 
Lo3 < 6.87 Lo, = 6.87 — = 0.116. ALAT 
03 0,3 6a2 ( ) 


As we shall now explain, this implies that 


Ly3 <687L", (4.1.18) 
for any y > 0. 
We note that, in general, the ratio L,, 4/ is q 1S decreasing in y [2], i.e., 
nd Se (4.1.19) 
Lyd Ly 48.4 


for any 5 > 0. To see this, one simply notes that fore < 0 and 6 > 0 
lel’? = cy, : dare le + ay’ 
0 


for some constant c,,5. Hence 


fdnae- SA wiageale ele 
iy EW i DV j>0, £40 1Ej + Al 


24V(x)\"dxdp © 
aba Y “ Po fanaa! ff |p? 4 V(x) Falrdxdp 
7 0 p?+V(x)+Aa<0 


To obtain an upper bound on the right side, one can certainly take the supremum 
over A of the ratios of the integrands, which is less then L,, 4/ LY. q by definition. 
On the other hand, Ly+5,4/ Le +3.q 18 the supremum of the left side over all V, 


cl cl 
and hence Ly+3,4/Lo 454 < Lya/Ly a: 


4.1.3 The LT Inequalities; Relativistic Case 
We shall now discuss the relativistic case. Here, the Schrodinger operator is 
H =|-iV + A(x)| + V(x), (4.1.20) 


with corresponding eigenvalues. 
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Theorem 4.2 (Relativistic LT Inequality). Fix y > 0 and assume that the neg- 
ative part of the potential V_ satisfies the condition V_ € LY*4(R¢). Assume 
that A € 12 (R?; R*). Let Eo < E, < Ex <--- be the non-positive eigenval- 
ues, if any, of | — iV + A(x)|+ V in R¢. Then, for y > 0 in case d = 1, and 
y = Oincase d = 2, there is a finite constant La, which is independent of V 
and A, such that 


SIE? S Ly [roe (4.1.21) 


j20 Rd 


We can take 

d+l 
rgyr) . 
Trd+d+y) 


Using a different method from the one presented in the proof below, following 
the method of [112], Daubechies [37] has shown that 


yg” (a + 5) 4/212 (4.1.22) 


Lo,.3 < 6.08 Dye 


in the relativistic case. In combination with the argument above this actually 
implies that L,3 < 6.08 Lis for all y > 0. In particular, 


6.08 
11,3 < 6.08 Li, = ml 


= = 0.0257. (4.1.23) 
IT 


In contrast, our bound (4.1.22) for y = 1 and d = 3 states that L1 3 < 7z Ls 
There is an LT inequality not only for — A and ,/—A but for all powers (— A)° 
with s > 0. Since we need only s = 1 and s = 1/2 we shall not discuss the other 
cases here. 
It will be noted that for relativistic mechanics it is necessary to consider 
the operator T,,(p) = / p? + ? — uw for > 0. To handle this we can use the 
simple facts that 


JK =p. <= (AF = eb = AA, (4.1.24) 


Since we are only interested in showing that the total energy is bounded above 
and below by a constant times the particle number N, the presence of a term 
LN is irrelevant. Nevertheless, one can ask if Theorem 4.2 can be improved to 
take explicit account of 7,,(p) = / p* + * — jw. The answer is yes if the right 
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side is replaced by the appropriate semiclassical expression 


(2x)~4 /| \7,.(p) + V(x)|” dxdp. (4.1.25) 


T,(p)+V(x)<0 


Note that for jz > 0 this expression is finite if and only if V € L’+4(R4)N 
LY*4/2(R¢), Moreover, the corresponding Lieb-Thirring estimates hold only in 
the non-relativistic range, i.e., fory > 1/2atd=1,y > Oatd =2, andy >0 
at d > 3. The reason for this is that T,,(p) © p*/(2) for small | p|, and hence the 
low energy eigenvalues will be approximately equal to the non-relativistic ones. 
Only at large values of | p| do relativistic effects become important. The effects of 
positive mass jz were analyzed by Daubechies in [37]. For d = 3, the best known 
bounds for the number of eigenvalues of T,,(p) + V is 10.33 times the semiclas- 
sical expression ((4.1.25) with y = 0) [70] and for the sum of negative eigen- 
values it is 9.62 times the semiclassical expression ((4.1.25) with y = 1) [37]. 


4.2 Kinetic Energy Inequalities 


Let us focus on the case y = 1 of the previous Section 4.1. The common setting 
is that we have an operator of the form H = T + V and we know that the sum 
of the negative eigenvalues of H is bounded below, as follows. 


Els Lf voyrex. (4.2.1) 


jz0 R¢ 


In the cases of interest T is —A (p = 1+d/2) or /—A (p = 1+), but it 
could be anything as far as the present section is concerned. Our goal is to use 
(4.2.1) to bound T instead of the E;. This was the bound used in [134, 135] to 
relate the Schrodinger energy to the Thomas—Fermi energy. 

Let W(X1, 01, X2, 02,...,X Ny, Oy) be any N-particle wave function of space- 
spin (x € R¢, o € {1,2,...,q}). It is not necessary for our purpose here to 
assume any symmetry properties. We are interested in the ‘kinetic energy’ 


N 
Ty := (« > E ‘) (4.2.2) 


i=1 
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We assume that w is normalized, i.e., (Ww, w) = 1, and our goal is to bound Ty, 
from below in terms of the one-body (spatial) density at x € R@ given by 


N 
: 2 
Oy(x) = So / |W(X1, O1,...,Xi-1, O71, X, Oj, ..., XN, On) | 
i=l 


2 RW-1ad 


x dx, ---dx;--- dry. (4.2.3) 


Here, we use the same notation as in Chapter 3, Eq. (3.1.5), with dx; meaning 
that integration is over all variables but x;. Note that is arbitrary and Ty, has 
nothing to do with any V; the latter is introduced only as an aid in proving the 
following bound. 

For the following theorem it is necessary to introduce the largest eigenvalue 
of the spin-summed one-particle density matrix of w. Recall from Chapter 3, 
Section 3.1.5, that the one-particle density matrix vy of y is given as 


N 
1 
y= Ye f Hei etiz zw) 
i=1 


x W(Z1, 220) Si-1s Zz. we, ZN)AZ1 bi dz; . --dzy. (4.2.4) 


The spin-summed one-particle density matrix a is then 


q 
Dy 'e,x) = Diy, 0, x',0). (4.2.5) 


o=l1 


Its largest eigenvalue will be denoted as || ||. The one-particle density in 
(4.2.3) is just oy(x) = yy (x, x). 


Theorem 4.3 (Fundamental Kinetic Energy Inequality). With the kinetic 
energy and the density as defined above, and assuming the inequality (4.2.1), 
the kinetic energy of a normalized fy is bounded as 


K z 
Ty = (p/p Oy (xX) dx (4.2.6) 
ee Wee? 
Rd 
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where p’ = p/(p — 1) and 


(pL) (p'K)? = 1. (4.2.7) 


Moreover, inequality (4.2.1) and inequality (4.2.6) (with (4.2.7)) are equivalent. 
The truth of (4.2.6) (with (4.2.7)) for all implies the truth of (4.2.1) forall V. 


Remark 4.2. For antisymmetric functions w, we have seen in Chapter 3, Sec- 
tion 3.1.5, that ||~"||,o <q, and this fact will be crucial for our application in 
the proof of stability of matter! We shall use (4.2.6) with q?/? in the denom- 
inator. In general, IPP loo < N for any w& because Tr[y] = Tr[y@] = N. 
For bosonic wave functions, this inequality can be saturated for simple product 
wave functions, (x1,...,Xy) = Pes P(x;). 


Remark 4.3. A slightly more general version of Theorem 4.3 also holds, in 
which one considers more general density matrices IX and not just rank one 
density matrices as here; that is, one can replace W(z)w(z') by Pe, ). (CL 
Section 3.1.4 in Chapter 3 for a discussion of density matrices.) We leave this 
simple generalization to the reader. 


Remark 4.4. If we take g = | and take y to be any determinantal function 
W(X1,%2,...,¥y) = (NIP det{ (x j= (4.2.8) 
where {bi} , is any collection of N orthonormal functions in L?(R?), then we 
find that 
N N 
ov(x)= So idi(x)? and = Ty = 9 (i, TH). (4.2.9) 
i=l i=l 
In this case, || 7), lloo = 1. 


Let us be very explicit and give the known value of p, p’ for d =3 in 
the non-relativistic and relativistic case, as well as best known value of K. 
The following corollary follows from Theorems 4.1, 4.2 and 4.3, together 
with the bounds (4.1.14) and (4.1.23) on the optimal constants in the LT 
inequalities. 
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Corollary 4.1 (Kinetic Energy Inequalities). For d= 3 and T =(-iV + 
A(x))’, there is a K independent of N such that 


K 5/3 
Ty 2 3 (1), 2/3 Oy(x)y'dx (4.2.10) 
IYyIloo ES 


for any v € H4(R3%). In fact, K > a K* and K"' = 3(6n2)2/3, 
Ford =3 and T =|—iV + A(x)|, we have 


Dice 5 sain | evcoas (4.2.11) 


IR3 


for any W € H4!7(R3%), with K > (6.08)-"3K" and K*! = 3 (6x2)'”. 


Proof of Theorem 4.3. Given w, we consider a one-body operator H = T+ V, 
as above. We note that both T and V act only on the spatial part of a function 
in L*(R4;C%) and not on the spin variables. We then consider the N-body 
operator 


N 
Ky = > Hi;, 
= 


acting on N-particle wave functions of space-spin. With the aid of the one- 
particle density matrix ae we can write the expectation value of Ky as (see 
Eq. (3.1.34) in Chapter 3) 


(W, Ky) =TrlA yy] = TlH py’. 


Here, we abuse the notation slightly, since the symbol Tr for the trace stands 
for the trace over L7(R7; C2) = L?(R“) ® C4 in the second term, but for the 
trace only over L?(R“) in the third term. The fact that H is independent of spin 
implies that the C% trace affects only Vy ) 

The minimum of Tr[Hy] over all positive trace-class operators’ y with 
IV lloo < IP lloo is clearly given by the sum of the negative eigenvalues of H 


* Operators A such that Tr|A| = Tr VATA < 0. 
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times yy loo. That is, the optimal choice for y is || a lloo times the projection 


onto the negative spectral subspace? of H. Hence 


(W. Kn) = ly leo D5 Ej. (4.2.12) 


j20 


A consequence of (4.2.12), together with (4.2.1), is that 


(W, Ky W = Ty + a VeNoeCoue 


Rd 
> Pp" loo D> Ej 
j20 
> “199 loo Ef V_(x)Pdx. (4.2.13) 
Rd 


This holds for any normalized y. Now we make a special choice for V, which 
will depend on the y being considered. We choose 


V(x) = —Coy(x)/P-9, (4.2.14) 


where C > 0 is some constant to be determined appropriately. From (4.2.13) 
and (4.2.14) we have that 


Te y Oy (x)? dx — [ly llooLC? ; Oy (x)? dx. (4.2.15) 
R¢ Ré 


Obviously, we choose C to make the right side of (4.2.15) as large as possible, 
C=O 07 aly”, (4.2.16) 


from which (4.2.6) and (4.2.7) follow. 

Finally, to prove that (4.2.6) (with (4.2.7)) for all & implies the truth of (4.2.1) 
take q = | and take yf to be the determinant formed from all the eigenfunctions 
corresponding to the negative eigenvalues of H = T + V (if there are infinitely 
many negative eigenvalues just choose any N of them and let N — oo attheend). 
For this y, Pe lhe = 1. We then have (using (4.2.6) and K = (pL)~?'/?/p’ 


> For our H, this subspace consists of linear combinations of eigenfunctions with negative or zero 
eigenvalues. 
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from (4.2.7)) 


ah + [ veroy(aax 


J 20 Rd 


>K ; Oy(x)? dx — / [(pL)'/? V_(x)II(pL) oy (x) dx 


R¢ Rd 


>K / Oy(x)? dx — = / (pL)? oy (x)? dx — ~ : [(pL)'/? V_(x)]?dx 
Rd Rd Rd 


a -1f V_(x)? dx. 


Rd 


The last inequality is Hélder’s inequality [ fg < (f f?)!/?(f g?)'/' followed 
by ab < a?/p+b? /p’. a 


4.3 The Birman-Schwinger Principle and LT Inequalities 


Our goal here is to prove (4.1.11) and (4.1.21) and, more generally, the analogous 
inequality for H = (—A)* + V(x). We will only consider the case A = 0, and 
explain the generalization to arbitrary A in Section 4.4. 

The first step is to note that we may as well replace V = V, — V_ by the non- 
positive potential — V_. The reason is that if we compare two operators H and 
H= (—A)* — V_(x) we see that H > H (in the sense that (¢, Hd) > (¢, Ho) 
for all functions #). This fact implies, by the variational min-max principle [118, 
Sect 12.1], that the corresponding eigenvalues are related by E; > E j for all 
j. Therefore, since we are interested in bounding the eigenvalues from below 
by —f/ V”, which is independent of V,, we may as well omit the V, term in V. 
Henceforth, H = (—A)* — V_(x). 


4.3.1 The Birman—Schwinger Formulation of the 
Schrodinger Equation 


The next step is to rewrite the Schrodinger equation for a negative eigen- 
value —e as ((—A)’ +e) W(x) = V_(x)w(x). If we now define (x) = 
/ V_(x) W(x) we obtain ((—A)* + e) W(x) = ./ V_(x) d(x), which implies that 


76 Lieb-Thirring and Related Inequalities 


w =((—A) + e) | /V_(&) #(x) or, equivalently, 
= Keo (4.3.1) 


where K, (called the Birman—Schwinger kernel [18, 157]) is the integral kernel 
given by 


K(x, y) = V V_-(x) Ge(x — y)V/ V_-(y), (4.3.2) 


where G,(x — y) is the integral kernel (or Green’s function) for the inverse 
of the positive operator (—A)* + e. Explicitly, (K.¢)(x) = te K(x, y)o(y)dy 
and similarly for Ge. 

The kernel G,(x — y) is well known and is given by the inverse Fourier 
transform 


1 
G(x —y)= / Dake pe? (27ik - (x — y))dk. (4.3.3) 
Rd 


While we will not need an explicit formula for G, let us record the function 
G_(x) for the usual Laplacian (s = 1) and dimensions 1 and 3: 


1 
COs 7, \-velF)): d=1,s=1 


G.(x) = exp(—Je |x|), d=3, 5=1. (4.3.4) 


Arr |x| 

Equation (4.3.1) says that when —e is a negative eigenvalue of H then | is an 
eigenvalue of K,. Although it is not a priori clear that ¢ = ./V_W is in L7(R) 
we claim that it is; moreover, we claim that there is a one-to-one correspondence 
between an eigenvalue —e of H and an eigenvalue | of K,. This is important for 
us because it enables us to reformulate the problem in terms of traces of powers 
of K. on L?(R¢). 

From now on we will assume that s = 1, and comment on the general- 
ization to s <1 at the end of the proof. If wy ¢ H'(R?) then 6 = /V_w 
is in L*(R“) by our assumptions on V, as explained in Section 2.2.1. If y 
satisfies —Aw — V_w = —ew then clearly ¢ #0 since otherwise —Ay = 
—ey which is impossible for an H! function. Moreover, K. maps L7(R?) 
into L?(IR¢) by the Hardy-Littlewood-Sobolev inequality [118, Thm. 4.3]. 
So @ is an L*(R®) eigenfunction of an L*(R‘) > L?(R¢) operator. Con- 
versely, if @ € L7(R“) satisfies K.¢=¢ then, if we define wy =(—A+ 
e) |./V_@, we have that (-A+e)W = ./V_¢ = /V_K.¢ = V_w. The 
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fact that w € L?(R®%) follows from (Ww, w) = (/V_¢, (—A + e)-*./V_¢@) < 
e'(./V_¢, (-A +e)! /V_¢) = e7'(¢, ¢). Moreover, yw 4 0 since other- 
wise ./V_d would be zero and hence K.¢ = 0. This one-to-one correspon- 
dence between w and ¢ implies that the multiplicities of the eigenvalue —e of 
—A — V_ and the eigenvalue 1 of K, are the same. 

There is another important observation about (4.3.1) and (4.3.2). For any given 
e > Othe operator K, has a spectrum of eigenvalues, which are all non-negative. 
(K, is compact and, moreover, Tr (K,.)” < 00, for suitable m > 0, as we shall 
see.) The observation is that K,, as an operator, is monotone decreasing in e. That 
is, if e < e’ then K, — K. > 0, as an operator inequality. This monotonicity is 
true for any operator of the form (A + e)~! with A > O since the difference is just 
(e' — e)(A+e) (A + e’)"!, which is clearly positive. Hence all the eigenvalues 
of K, are monotone decreasing in e. They are also easily seen to be continuous.° 

These facts stated above are shown schematically in Figure 4.1 in which the 
eigenvalues A; of K, are plotted as functions of e. This figure immediately leads 
to the following conclusion: For each number e > 0 we can define 


N. := number of eigenvalues of H less than or equal to —e, 


including multiplicity, as usual. We can also define B, to be the number of 
eigenvalues of K, that are > 1. Then 


N. = B.. (4.3.5) 


4.3.2 Derivation of the LT Inequalities 


Let y > 0. To exploit (4.3.5) we write 
Ce 


IE => = y [ en, de (4.3.6) 


j20 j20 0 


which is easily verified by integration by parts while noting that the derivative 
of N- is just a sum of unit delta-functions at the numbers e;. 


® The simplest way to see continuity is to note thatO < K, — K. < [(e’ — e)/e']K, for0 <e < e’. 
By the min-max principle [118, Thm. 12.1], the eigenvalues of K, differ from the corresponding 
eigenvalues of K, by at most (e’ — e)/e’ times the norm (i.e., largest eigenvalue) of K.. 
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Figure 4.1: The eigenvalues A) > A; > --- of K, are schematically shown as a function 
of e. Only the first five are shown here. The eigenvalues of the Schrodinger equation are 
E; = —e; and the e; are the values of e for which A; = 1. 


On the other hand, B, is certainly < Dos 1 rN < Tr (K.)” for any number 
m > 0. By Theorem 4.5 in the appendix to this chapter and (4.3.3), 


Ne = B. < Tr (K,)” 
< Tr(V_)"/*(G.y"(V_)"/? 


= ; V_(x)" G20) dx 


Rd 


1 m 
_ / Gonbr ron / V_(x)" dx (4.3.7) 
Rd R¢ 


as long as m > 1. The expression (/' .. .) on the right side is just G”"(0), as we 
see from the analogue of (4.3.3) for the kernel G”’(x — y). This expression is 
finite if and only if 2m > d, in which case it is, by scaling, Cg,,e7"*4/? with 


dk = (4n) 22 4) 


1 
ar | (Oakey Fun) 
Rd 


(4.3.8) 
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Unfortunately, when e~”*4/? is inserted in (4.3.6) we obtain a divergent 
integral either at e = 0 or at e = ov, for any m. To remedy the situation 
let us consider the potential W.(x) = [V(x) + e/2]_ = max{—V(x) — e/2, 0} 
(which depends on the value of e under consideration in (4.3.6)). Clearly, W, is in 
LY*4/2(R¢) if V_ is. Moreover, with N,(V) denoting the number of eigenvalues 


of —A + V which are < —e, it is easy to see that 

Ne(—V_) = Nej2(—V_ + €/2) < Ne/2(—We) (4.3.9) 
because W, > V_ — e/2. Let us, therefore, replace e by e/2 and V_ by W, in 
the right side of (4.3.7) and then insert the result in (4.3.6). We obtain 

2V_(x) 
a [Ej\” <y Cam / / ev l-m+d/2. gm—d/2(y (x) — ¢/2Y"de | dx 
jz0 Rd 0 
Tm — d/2)l'(-m+ y+ d/2) 
(iy +1+d/2) 


= (47)~4/?2" ym V_(x)’*4/? dx. 


(4.3.10) 


In order for (4.3.10) to be finite we have to require m to satisfy d/2 <m < 
y + d/2, so we choose m = (y + d)/2. For d = | this choice is only greater or 
equal to 1 if y > 1. Hence, if d = 1 and 1/2 < y < 1, we choose m = 1. This 
leads to the value given in Theorem 4.1 and concludes the proof of that theorem 
(except for the cased = 1, y = 1/2 andd > 3, y = 0). 

The proof for s = 1/2, or any other s for that matter, proceeds in exactly 
the same way and we leave the details for the reader. To obtain the con- 
stants in (4.1.22), one chooses m = d + y/2 in the relativistic case. This proves 
Theorem 4.2 (except for the cased > 2, y = 0). 


Remark 4.5. For y = 1 andd = 3, ourchoice of mism = (y + d)/2 = 2 inthe 
non-relativistic case. In this special case it is very easy to prove inequality (4.3.7) 
without having to employ the general Theorem 4.5. In fact, using a simple 
Schwarz inequality and Plancherel’s identity (2.1.40) we have 


Tr(K.)Y = / / V_(x)Vo(y)Ge(x — y)’dxdy < i V_(x)*dx / Ge(y)dy 


R3xR3 R3 R3 


7% Oe Gee 
, (2rk|? +e)” 
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This case is of particular relevance in the proof of stability of non-relativistic 
matter discussed in Chapter 7. 


4.3.3 Useful Corollaries 


The special case y = 0 in Theorem 4.1, the CLR bound, is not covered in the 
foregoing proof. It is abound on No, the total number of non-positive eigenvalues 
of —A + V(x), and is valid only for d > 3. 

There is, however, a bound on N,, the number of eigenvalues below —e, 
which is valid in all dimensions when e > 0, and which will be useful later in 
this book. Its proof is actually contained in the proof of Theorem 4.1 in the 
previous subsection, and so we take linguistic liberties and call it a corollary. 


Corollary 4.2 (Bound on N,). Let d>1, y > 0 and e > 0. Then N,, the 

number of eigenvalues of —A + V(x) less than or equal to —e, is bounded by’ 
pe ry) 

a ey (4rr)4/2 P(d/2 + y) 


[resent dx. (4.3.11) 


Similarly, in the relativistic case, the number of eigenvalues of /—A + V(x) 
below —e is bounded by 
Y d+1 
N.< 2 f Pero 
~ ev e4tD/2 d+ y) 


i [V(x) + e/2]'" dx. (4.3.12) 


Corollary 4.2 is proved by combining (4.3.5), (4.3.7) and (4.3.9). 

One more inequality that is easily derived from the proof of Theorem 4.1 
(which even extends down to e = 0) is due to Birman [18] and Schwinger [157]. 
It is valid for d = 3 only. 


Corollary 4.3 a Bound). For e > O0andd = 3 


[V(x) + e]_[V(y) + e]- 

Nex Sap I] c= He dxdy (4.3.13) 
4/3 

ae | Ve) +eldr | (4.3.14) 


7 We emphasize that e in (4.3.11) is the energy and not 2.718. 
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Proof. We first consider the case e > 0. For 0 < € < e, we use 
NAV) < N.(—[V(x) + e — €]_). 


According to Eq. (4.3.5), this latter quantity equals the number of eigenvalues 
of 


1 
/tV — ¢]_——__—_ 
[eae ele, 7 
that are greater or equal to 1. For d = 3, the integral kernel of this operator is 


given by (compare with (4.3.4)) 


aN pe el- 


(= velx — yl) 


exp 
VIVix) +e -e]- VIVO) +e -€]- =a 


The number of eigenvalues greater or equal to 1 is certainly less than the sum 
of the square of all eigenvalues. The latter is simply the square of the Hilbert— 
Schmidt norm, which equals 


1 | [V(x) +e -—e]_[V(y)+e-e]_ 


(4a) Ix — y|? 
R?xR3 


exp (—2,/e|x — y|) dxdy. 


The exponential factor can simply be bounded by one. The remaining integral 
is monotone in ¢ and converges to the right side of (4.3.13) as e > 0. 

The reason we cannot apply this argument directly for e¢ = 0 is the fact 
that (4.3.5) does not hold, in general, for e = 0. To circumvent this prob- 
lem and prove (4.3.13) for e = 0, consider a perturbation of V of the form 
V(x) > V(x) —AW(x), where W(x) is a smooth and rapidly decaying func- 
tion that is strictly positive. By the variational principle, the number of strictly 
negative eigenvalues of the operator —A + V — AW is greater or equal to No(V). 
Hence 


No(V) < lim lim N.(V — AW). 


Applying the bound for N.(V — AW) above and using monotone convergence, 
we arrive at the desired result. 

Finally, the inequality (4.3.14) follows from (4.3.13) by an application of the 
Hardy—Littlewood—Sobolev inequality, see [118, Thm. 4.3]. | 


Compared with the CLR bound (Eq. (4.1.11) for y = 0) the inequality in 
(4.3.14) contains the 4/3 power of the dimensionless quantity [ Vv’ instead of 
the first power. This might look like an innocent detail, but it can make a huge 
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difference. In fact, if V is the sum of N widely separated pieces, the number 
of bound states will roughly be the sum of the number of bound states for each 
piece, and so will be proportional to N — not N*/?! The important point about the 
LT inequalities is that there is no exponent outside the integral, for any y > 0, 
and this confirms the intuition obtained from the semiclassical expressions. This 
fact is crucial for the understanding of stability of matter, as will be discussed 
in the chapters to follow. 


4.4 Diamagnetic Inequalities 


The proof of the Lieb-Thirring inequalities presented in the previous section 
carries over to the magnetic case essentially without change. In particular, 
Theorems 4.1 and 4.2 also hold, with the same constants, when — A is replaced 
by (-iV + A(x))* for some vector potential A(x). In fact, all the best presently 
known constants in these inequalities are independent of A(x), since all the 
proofs use the diamagnetic inequality in one way or the other. We shall explain 
this inequality in the following. 

Note, however, that the situation changes dramatically when spin is introduced 
(cf. Section 2.1.6 and Chapter 9), since there can be cancellations between the 
(—iV + A(x))? term and the extra term o - B. LT-type inequalities in this case 
were given in [123, 47, 133, 25, 26, 169, 49, 50], but we shall not discuss them 
here. 

One formulation of the diamagnetic inequality is the fact that the integral 
kernel of the resolvent of (—iV + A(x))’, i.e., 


1 
(-iV + A(x))? +e 


(x, y) 


for e > 0 is point-wise bounded above by the corresponding quantity for A = 0, 
that is 


(x, y)=G,.(x —y) forallx, y eR’. 
(4.4.1) 


1 
(iv +Aqy4te = —-A+e 


Obviously, (4.4.1) is enough to establish the LT inequalities. Its proof is due to 
Simon [165]. We first consider the following Lemma. 
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Lemma 4.1 (Kato’s Inequality). Let sgn(u) = i/|u| if u(x) 4 0, and 0 oth- 
erwise. Assume’ that A € C!(R4), ue Li (R*) such that (-iV + Au € 
L?.(R4). Then? 


—Alu| < Rsgn(u)(—iV + A(x))?u. (4.4.2) 


For simplicity, we shall only prove this Lemma if u € C? and |u| € C*. We 
refer to [150, Chapter X.4] for the extension. 


Proof. Since 2\u|V|u| = V|u|* = 28% aVu, we first note that 
ju|V|u| = RaVu = Ra(V +iA)u. 
Taking the divergence of this equation, we obtain 
|V ull? + ful Aju| = |(V +iA)ul? + Ra(V +iA)u. 
By writing u as its absolute value times a phase, it is easy to see that 
CV + iA)ul? > |V |u|? (4.4.3) 
This implies the statement of the lemma. | 
An immediate corollary of the inequality (4.4.3) is that 
(Ww, (-iV + A@)YPW) = (Wl, -Alw) (4.4.4) 
for any yw € H'(R®). The analogue in the relativistic case is 
(W,|-iV + A(e)ly) = vl, VAD, 


which can be obtained from Eq. (4.4.8) in the following theorem by a suitable 
t — 0 limit. These inequalities show that the ground state energy of single 
particle in a potential V is lowest when the magnetic field is absent. 

With the aid of Kato’s inequality, we can now prove 


Theorem 4.4 (Diamagnetic Inequality). Let A € L?,.(R“; R). Then, fore > 0 


loc 
and for almost every x, y € R4, 


1 
(iv +A@y +e") = age 


(x, y). (4.4.5) 


8 C* consists of functions with continuous derivatives up to order k. 
° Sz denotes the real part of a complex number z. 
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Moreover, in the relativistic case, 


1 1 
| iV + A@)| ao) = Fea (4.4.6) 
Similarly, for the heat kernel at t > 0, 
ere. Se Gey) (4.4.7) 
and 
JetVH4@l (x, yy] < et¥-Acx, y). (4.4.8) 


In the case without magnetic fields, the heat kernel can be computed explicitly 
[118, Sects. 7.9 & 7.10]. In the non-relativistic case, it is given by 


eA(x, y) = exp (—|x — yl?/(4r)), 


1 
(4rt)4/ 


whereas in the relativistic case it is 
t 


eV-Ay, y) =P 4 )eer 
(P+ ie =y/? 


4 


aia 


Proof. By an approximation argument [167, Theorem 15.4], it is enough to 

consider the case A € C!. Thus, we can apply Kato’s inequality in Lemma 4.1. 
The Green’s function 

1 


A = 
oe Cay ada ae 


(x, y) 
satisfies the differential equation 
(-iVx + A(X))’GE(, y) + eG2(x, y) = (x — y) 


in the sense of distributions. If we multiply this equation by sgn(G4) and use 
Kato’s inequality (4.4.2), we obtain 


(—A, + e) |GA(x, y)| < d(x — y)Rsgn(GA(x, y)) < d(x —y). (4.4.9) 


In particular, multiplying Eq. (4.4.9) by G_(y’, x) and integrating over x, this 
yields the desired inequality 


IGA(y’, y)| < Gely’, y). 


In order to prove the corresponding result in the relativistic case, we first note 
that the diamagnetic inequality for the resolvent implies a corresponding result 
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for the heat kernel. More precisely, since 


_B : ( n ) 
e” = lim 
n>oo\n+B 


the inequality (4.4.5) implies (4.4.7) for any t > 0. This diamagnetic inequality 
for the heat kernel can then be extended to the relativistic case, using the fact 
that 


CO 
ote — ; a PIP 4a). (4.4.10) 
0 


This formula implies (4.4.8). From this bound one obtains the result (4.4.6) for 
the resolvent by integrating over f, using that 


[o,@) 


1 
—= foe 
Y 


0 


for Y > 0. We omit the technical details. |_| 


A natural way to view these domination questions is by means of functional 
integrals, which can be defined for (— A)* for 0 < s < 1. A very good discussion 
of this is in Simon’s book [167]. 


4.5 Appendix: An Operator Trace Inequality 


The following inequality about traces of operators is needed in order to bound 
the traces of the Birman—Schwinger kernel that appear in the proof of the LT 
inequalities. This inequality was originally proved in [135, Appendix B] with the 
aid of a sophisticated theorem of Epstein [46]. Earlier, Seiler and Simon [160] 
had proved the theorem using interpolation techniques when H = L?(R®), A is 
a convolution operator and B is a multiplication operator, which is the case we 
actually need for application to the Birman—Schwinger kernel. 

Subsequently, Araki [4] found a proof that was a bit longer but which used only 
well known facts and which, therefore, can easily be given in a self-contained 
fashion. We prove the theorem here using Araki’s method and refer the reader 
to [4] for a generalization of the theorem. 
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Theorem 4.5 (Traces of Powers). Let A and B be positive, self-adjoint opera- 
tors on a separable Hilbert space H. Then, for each real number m > 1, 


TP CARY = Tr BAB, (4.5.1) 


Proof. Step 1: We will need the following little proposition. Suppose that a = 
{a, > a2 > a3 >...} and b= {bi > bo > bz > ...} are two infinite ordered 
sequences of real numbers satisfying the condition a < b , meaning that 


Sas) & for ay 2.3, 53 (4.5.2) 
j=1 j=1 

Then 
>i et < Soe for n=1, 2, 3,.... (4.5.3) 


Remark 4.6. Inequality (4.5.3) can be rewritten in the following way: If A and 
o are two (decreasingly) ordered sequences of non-negative numbers then 


[ [oi < []4; for alln = 1, 2,3,... 
j=! j=l 


==) os ) Ap mor all ta 1, 2. Sigs (4.5.4) 
j=l 


j=l 


Remark 4.7. This proposition holds with e* replaced by any monotone- 
increasing, convex function. It was first proved by Hardy, Littlewood and Polya 
in 1929 [85], but is often known as a theorem of Karamata, who proved it inde- 
pendently in 1932 [98]. A nice proof, using ‘linearization’ is in [11, Sec. 28-30]. 
Another useful reference is [90]. The proof given here also works for all mono- 
tone convex functions and is close to that in [98]. 


To prove (4.5.3) we write e* = Pe: — y),e’dy, where z, := max{z, 0}. We 
then see that (4.5.3) will follow for the function e* if we can prove the ana- 
logue of (4.5.3) for the monotone-increasing, convex function x bh (x — y)4, 
for each fixed real number y. But this is easy to do. We note the obvi- 
ous fact that either Via; -y,= ya — y) for some k <n or else 
Via — y),4 =(a; — y), = 0. The case ee Ce — y), = 0 is trivial and 
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otherwise 
n k k k n 
V@-y+=)@-yv< Le-y< UG -y. < Lei-ys 
j=l j=l j=l j=l j=l 
(4.5.5) 
which proves (4.5.3). 
Step 2: Define the positive operators 
X = B™?A™ Bm? and Y =(BY*A BM*)". (4.5.6) 


We let i denote the eigenvalues of X in decreasing order (followed by an infinite 
string of zeros in case our matrices are finite dimensional). Similarly o are the 
eigenvalues of Y. Clearly, inequality (4.5.4) will prove our theorem if we can 
show that []_, 4; = []/-, 0; for every n > 1. To do this, consider the n-fold 
antisymmetric tensor product /\" H, and the operators A = X @ X @---@X 
and 2 = Y @ Y @---@Y acting on this space. The /argest eigenvalue of A is 
exactly what we are looking for, namely Tha Aj, and that of & is TTie1 a;. To 
conclude our proof, therefore, we have to show that the largest eigenvalue of A 
is not less than that of X. 

Fortunately, A and & can be written in another way. Define a = A @ 
A®---@®Aand B= B@B®@---@B. Then, since ordinary operator prod- 
ucts and tensor products commute, we have that 


A= pram p™? and Y= (Bp /7ap'/)”, (4.5.7) 


which is the form of the operators in our proposed inequality about the largest 
eigenvalues for the particular case n = 1, except that the operators are different. 
That is to say, A is replaced by a, B is replaced by 6 and the Hilbert space is 
replaced by /\" H. 

With A and & as given in (4.5.7), for unknown, but positive operators a and 
B, we only have to prove that the largest eigenvalues e, and ey satisfy e, > ey. 
Let us assume (after scaling) that e, = 1. This means that, as an operator, A < I. 
Our goal will be achieved if we can then infer that & < I. 

Now A <I implies that aw” < B~”. Taking the m"™ root (for m > 1) is 
operator monotone,” i.e., a" < B~™ implies that a < B—!'. We then have that 


'0 Note that although taking a root is operator monotone, taking powers > 1 is not, in general. 
The fact that x > x? is operator monotone for 0 < p < 1 follows easily from the integral 
representation x? = 2 ~! sin(pz) ig tP(t7! — (t + x)7dr, together with the observation that 
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B'? ap? < p'/?p-'B!/? — 1. Hence, © = (B'/2aB!/*)” < I, and our proof is 
complete. | 


Remark 4.8. We have actually proved more than (4.5.1), which refers to the 
sum of all the eigenvalues. We have proved that the inequality holds for the 
largest eigenvalue and for the sum of the n largest eigenvalues of each operator. 


x t+ —1/x is operator monotone. We refer to [17, 42] for more information on operator 
monotone functions. 


CHAPTER 5 


Electrostatic Inequalities 


5.1 General Properties of the Coulomb Potential 


The interaction energy of charged particles is described by the Coulomb poten- 
tial, and the proof of stability of matter depends to a large extent on having good 
estimates for the size of this interaction for systems with many particles. 

From this point on, until the end of the book, we will restrict our attention to 
the physical case of R*. Many of the results have a natural extension to different 
dimensions, but we shall not explore them here. 

Charge distributions may be continuous or discrete and hence it is convenient 
to consider them as Borel measures.! The potential function ® associated with 
a Borel measure ju is defined by 


1 
ox) = | p(dy). (5.1.1) 
ss Ix — y| 


For a positive measure jz this expression is always well defined in the sense that it 
might be +oo. If jz is a signed Borel measure, i.e., 4. = o — t with o, T positive 
Borel measures, the potential may not be well defined because of cancellation 
problems. If, however, we assume that the positive measure v = o + T is finite 
in the sense that 


1 

—— v(dx) < oo, 5.1.2 

law 65.1.2) 
R3 

it is not hard to see that under such an assumption the potential ®(x) is finite 

almost everywhere, in fact it is in Li (R?) (see [118, Chap. 9]). Note that 

any signed measure can be written as the difference of two positive measures 


' For readers unfamiliar with measures, just think of j(dx) as @(x)dx where g is some function 
in L}(R?). See [118, Chapter 1]. 


loc 
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with disjoint support. We shall not use this fact, which is known as the ‘Hahn 
decomposition’, in any way. 
The Coulomb energy D(,1, j1) of a — distribution jz is defined by 


pee >| /— yen utay). (5.1.3) 
ae 


Again, this expression is well defined in the case where yu is positive. In the 
following a charge distribution will be a Borel measure 4 = 0 — t where both, 
o and T are positive measures satisfying the condition (5.1.2) and, moreover, 
both have a finite Coulomb energy. We do not require that o and t have disjoint 
support. The following theorem shows that the expression (5.1.3) is well defined 
for charge distributions. 

The measures we are mostly concerned with are absolutely continuous, that 
is, they are of the form u(dx) = o(x)dx for an integrable function g. In this 
case, we shall slightly abuse notation and use the symbol 


1 
D(oe, 0) = 5 i ee ax dy (5.1.4) 


RxR 
for the Coulomb energy of the corresponding measure. 
Quite generally we define the interaction energy of two charge distributions 
LL, v by 


1 1 
D(w, v) = 5 // 7 yee (5.1.5) 


R? xR? 


Theorem 5.1 (Positive Definiteness of the Coulomb Potential). Assume that 
jis a charge distribution as defined above. Then 0 < D(w, 4) < 00. Moreover, 
for any two charge distributions [1 and v 


Dw, vy < Du, w)D(, v). (5.1.6) 
Proof. A calculation (see, e.g. [118, 5.10(3)]) shows that 


1 1 1 1 
——— ———__- ———-dz 
Ix—y| wes |x —z\? ly—2/? 
R3 


(5.1.7) 


(The fact that the two sides of (5.1.7) are proportional to each other follows 
from the observation that they are both functions only of |x — y| and both are 
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homogeneous of degree —1; the only thing that has to be worked out is the 
constant 2~>.) Hence, for any charge distributions ju, 


2 
1 1 
D(u, L) = =| / —— (dx) | dz > 0. (5.1.8) 
2r3 lx =| 
R3 R3 
It is easy to see that D(wu, “) < 6, since D(o,o) and D(t, T) are finite by 


assumption. This fact, as well as inequality (5.1.6), follows from Schwarz’s 
inequality and the representation 


Dii.v)= 54 =| [— Paes ) Jee pide) dz. (5.1.9) 


R3 R3 
a 


Another fact about the Coulomb potential, which will be of great importance, 
is Newton’s theorem. It states that outside a charge distribution that is rotation- 
ally symmetric about a point x9, the associated potential looks as if the charge is 
concentrated at the point x9. Recall that a measure yp is rotationally symmetric 
with respect to a point x9 if for every measurable set A and any rotation R which 
leaves Xo fixed, 


W(RA) = WA). (5.1.10) 


Theorem 5.2 (Newton’s Theorem, [144, Thm. XXXI]). Let uw be a charge 
distribution that is rotationally symmetric with respect to the origin. Then 


1 1 
P(x) = — / u(dy) + i — (dy). (5.1.11) 
ren ae 
yl S|x yl>|x 


Consequently, if, for some R > 0, w({y : |y| > R}) = 0 then 


O(x) = eer) for |x| > R. (5.1.12) 


Likewise, if u({y : |y| < R}) = 0, then 
1 
(x) = constant = / iy he? for |x| < R. (5.1.13) 


lyl>R 
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Proof. Since the measure yp is rotationally symmetric with respect to the origin, 
so is the potential ®, i.e., O(x) = O(y) whenever |x| = |y|. Thus, ®(x) = 
(®)(x) where (®)(x) denotes the average of ®(|x|@) over the unit vector w. 
Using (5.1.1) this requires that we calculate the integral 


1 
l=” (5.1.14) 
Z \|x|@ — y| 


where dw is the uniform normalized surface measure on the unit 2-sphere. By 
rotation invariance we can let the vector y point towards the north pole. Resorting 
to polar coordinates, one is led to the calculable integral 


1 
| : d { Pe | (5.1.15) 
5s =min{—, —}. Al. 
2 , (Ix |? + ly/? — 2|x||y|s)!/? Ix] |y| 


Formula (5.1.11) now follows. | 


5.2 Basic Electrostatic Inequality 


When estimating the Coulomb potential of a collection of nuclei and electrons 
one faces two essential difficulties. The first is that the potential is singular at 
the location of the nuclei. The second, more serious one, is that the potential 
can also be large away from the nuclei because there might be many nuclei. The 
following electrostatic inequality, proved in [138, Lemma 1], disentangles these 
two issues. It suffices for several of the proofs of stability in this book, but a 
more sophisticated version turns out to be necessary for some stability results 
concerned with the quantized electromagnetic field. That version, which uses 
this basic version as input, will be given in Section 5.4. 

To describe the inequality in detail we need the notion of a Voronoi cell with 
respect to a collection of nuclei of equal charge Z located at points R;. Let 
R,,..., Ry be M distinct points in R?. Define the cell 1; C R? by 


r,={xe R?: |x — R;| < |x —R;|, for alli F j}. (5.2.1) 


Clearly I’; is open and it is easily seen to be convex. Its boundary, dI’;, is a finite 
collection of segments of planes and, possibly, the point at infinity. 
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Denote (half) the nearest neighbor distance of the nucleus located at R; ¢ T; 
by 
Ms 
Dj:= oo —Rj|. (5.2.2) 


This is the distance from Rj; to dI’;. The Coulomb potential felt by an electron 
at the point x, generated by M nuclei of equal charge Z, is the negative of 


1 
W(x) = oR rey 7 =a 


which is a harmonic’ function on the complement of all the R ;. In the following, 

it will be important that all the nuclear charges be equal. Concerning the question 

of stability of matter, this is not an important restriction, as noted in Section 3.2.3. 
Define 


D(x) = min{|x — R;|:1<i< M}, (5.2.3) 
and set 
D(x) = W(x) — = 5.2.4 


In other words, for x € I";, the number —®( x) is the value of the potential at the 
point x due to all the nuclei outside the Voronoi cell T’;. It is important to note 
that ® is a continuous function on R* (here we use the fact that all the nuclei 
have the same charge Z). Although ® is harmonic in each Voronoi cell, it is 
not harmonic on the whole of R? since ® is not differentiable on the boundary 
of the Voronoi cells. Its derivative makes a jump as we go from one cell to a 
neighboring cell. 

An obvious connection between ® and the Coulomb repulsion energy of 
nuclei of equal charge Z is 


2 
Z > RR a ma 32 HR). (5.2.5) 


Our main concern is to estimate the setiial energy of an electron, taking 
into account all the other electrons and all the nuclei — except for the nucleus 
in the cell in which the electron finds itself. The Coulomb repulsion among 
the nuclei is included. The following theorem gives such an estimate. We will 
utilize it later by essentially identifying the arbitrary measure jz that appears in 
the theorem with the (smoothed) charge density of all the electrons. 


? A harmonic function g is one satisfying Ag = 0. Equivalently, the average of g over any sphere 
equals the value of ¢ at the center of the sphere. See [1 18, Chapter 9]. 
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Figure 5.1: Example of Voronoi cells in two dimensions. 


Theorem 5.3 (Basic Electrostatic Inequality). For any charge distribution 
= by — w- with D(z, U+), DQu_, “_) < 06, and for any set of distinct 
points R,,..., Ry € R’, 


Diw.u)— f exude) +P Se (5.2.6) 
ee LL DR RI 82D, Zz 


where ® is defined in (5.2.4) and Dj in (5.2.2). 


Proof. Our first goal is to find a charge distribution, v, that generates the potential 
®. That is to say, 


D(x) = he 7 ree 2 (5.2.7) 
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or, equivalently, 
—A®=Anv. (5.2.8) 


To this end, pick an infinitely differentiable function f of compact support, and 
calculate 


[ ecarenar = >| O(x)A f (x)dx 
a 


= » / div (®V f)(x)dx — al VO(x)- V f(x)dx 
J rT; J T; 


=. / D(x); - Vf (x)dS — ~/ VO@(x)- V f(x)dx, 
j it, 


J ar; 


where fi; is the outward normal to dl’; and dS denotes the two-dimensional 
Euclidean surface measure on dI’;. Since ® and V f are continuous the sum of 
all of the boundary integrals is zero. Furthermore, 


->| V f(x): V®(x)dx = - > fair vencnae + y f(xX)AP(x)dx 
in; in; in; 


= > / div( f V®)(x)dx (5.2.9) 


Try 


since ® is harmonic in I";. Hence we have 


/ O(x)Af(x)dx = ->— / f(x)aj - VO(x)dS. (5.2.10) 


R3 J ar; 


Note that this expression does not vanish since V ® is not a continuous function. 

The boundary dI°; consists of two-dimensional planar segments each sepa- 
rating I"; from some other Voronoi cell I’; (and possibly the point at infinity, 
which we can ignore for the present since f has compact support and hence all 
integrands have compact support). Note that on the right side of (5.2.10) each 
boundary segment appears twice, once as the boundary of a Voronoi cell and 
once as the boundary of its neighbor. The outward normals, however, point in 
opposite directions. Therefore, if we insert (5.2.4) in the right side of (5.2.10) 
we see that the contribution of W(x) for each segment cancels since W(x) is 
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a differentiable function away from the Rj;, and hence is differentiable in a 
neighborhood of dI°;. Thus (5.2.10) becomes 


: Z 
i, O(x)Af(x)dx = a / f(x)nj; - ‘ay (5.2.11) 
R3 J ar; 

Choose any two Voronoi cells I’; and I’, that share a planar segment. Since 
every point on this segment has the same distance to the point R; and R; we 
learn that the gradients of D(x)~', one taken from the interior of I; and one 
taken from the interior of I, and then evaluated on the segment, have the same 


magnitude but opposite orientation, i.e., 


1 1 
a; -V —— =’ -V—_ 
|x — Rj | |x — R;| 


for all x on the segment. This allows us to rewrite (5.2.11) as 


1 
/ O(x)Af(x)dx = 22 / f(x)nj- Veena (5.2.12) 
xX —" . 
R3 Ujar; z 
It follows from this formula that the measure v that generates ®(x) in the sense 
of (5.2.7), (5.2.8) is a surface charge density concentrated on the planar part of 


U;dI;. On the segment joining the Voronoi cells I’; and I’; it has the magnitude 


Zz. 1 iz 1 


2a ° |x = R;| 20 |x = R;| 


Moreover, v(dx) is positive, which follows from the fact that the Voronoi 
cells are convex, and hence f; - (x — R;) = 0 for all x ¢ dI’;. Thus we have 
achieved our first goal, to identify the measure v in (5.2.7), (5.2.8). 

For any measure j4(dx) we can write 
2 


D(w, — | ® d —____. 
w.n)— f cota) + Te ey 


7 
= D(u—v, w—v)— Div, v) + }> ———_ 
op Re — Ril 


72 
k<l 


IV 


since D(u — v, 4 — v) > 0 by Theorem 5.1. 
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It remains to calculate D(v, v). First, note that 


way =z) f ay R; 


J R3 


where 6 is the Dirac delta-function, so that (5.2.7) implies that 
= W(x)v(dx) = Z >), @(Rj). We use this fact in the following way. 
With the aid of (5.2.5), 


D ® d W dz) = — J = = 
(vy, v) = >| (x)v(dx) = >| (x)v(dx) afar x) 


oe )- fa" v(dx) 


2 
= d. 
= RR a i 


Next, 


1 Z I , 1 
= v(dx) = -yo= ———a; - V—_dS, 
E- j BT Ix — Rl ° |x — Rj| 


which, by a straightforward calculation, equals 


Lief =e Tif * = ‘aad 


where I“ is the complement of I’;. Note the change of eal Since 


1 2 


Ix -—R,\? |x -—R,|*’ 


we need only calculate a lower bound to the integral 


a / : d. (5.2.14) 
Bo 2, 
x J |x —R,\* 
ry 


This is an integral over the complement of the Voronoi cell I';, which is a 
set containing the half-space whose boundary plane touches the ball of radius 
Dj; centered at Rj; (in fact the boundary of this half-space contains the planar 
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segment separating I"; from its nearest neighbor). Thus we get a lower bound 
by just integrating (5.2.14) over that half-space. By shifting and rotating 
coordinates we may assume that R; = 0 and that the bounding half-plane is 
parallel to the y—z plane. Hence 


Z? 1 a ae | 1 2? 
dx > dz / d / dx = 7 
eo =| y+ y+ e2P 8D; 
Mr —0o —0o D; 
which proves the theorem. | 


5.3. Application: Baxter’s Electrostatic Inequality 


The basic inequality, Theorem 5.3, will now be applied to proving an inequality 
of Baxter [10] that was originally proved very differently using some sophisti- 
cated concepts of potential theory.° The derivation from Theorem 5.3 given here 
is due to Solovej (private communication). 

In order to bound the Coulomb potential Vc(X, R) from below, we would like 
to apply Theorem 5.3 to a measure that is appropriate to N electrons, namely 
(dx) = yy d(x — x;)dx. Inequality (5.2.6) is of no use in this case, however, 
because of the infinite self-energy of jz. What we are after here is an inequality 
of the form (5.2.6), without these infinite self-energy terms. This is the content 
of the following theorem. 


Theorem 5.4 (Baxter’s Inequality). Let V-(X, R) be the Coulomb potential 
of N electrons with coordinates x; € R? and M nuclei with common nuclear 
charge Z and with coordinates Rj € R?, as given in (2.1.21)-(2.1.24), namely 


Z Z 


i N M 
eee cer 
7 |x — || 2 RR 


V(X, R= > mom 


1<i<j<N i=1 


=! 


— 


(5.3.1) 


As before let T ; denote the Voronoi cell belonging to R; and let D(x) denote the 
distance from x to the nearest nucleus; if x € Tj then D(x) = |x — R;|. With 


3 Baxter’s inequality in [10] did not have the positive term on the right side of (5.3.2). 
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D; half the nearest neighbor distance defined in (5.2.2), there is the inequality 


a ee 
V(X, R) = ee a B (5.3.2) 


J 


j=l 


Remark 5.1. This theorem says, in effect, that for a lower bound the electrostatic 
interactions can be thought of as canceling each other except for the residual 
interaction of each electron with its nearest nucleus. Note that (5.3.2) gives a 
lower bound on the many-body potential Vc in terms of a one-body potential 
for the electrons. The striking feature of this inequality is that (NV + M)? terms 
on the left side are bounded by only N + M terms on the right. A defect of this 
theorem is that the Coulomb singularity on the right side has a weight 2Z + 1 
instead of the expected Z. For non-relativistic matter, where stability holds for 
all w and Z, this defect only amounts to a modified numerical constant in the 
energy estimate, but for relativistic matter, where there is a critical value of 
aZ <2/z for stability, the factor 2Z + 1 will yield an incorrect value for the 
critical value of a Z. This defect will be remedied in the refined inequality in the 
next section. 


Proof. For simplicity, denote d; = D(x;). Let u;(dx) = (d?x)~'8(|x —x;|- 
d; /2)dx be the normalized uniform measure supported on a sphere of radius 
d; /2 centered at x;, and let uw = Ee L4;. (Note that we use d; /2 and not d;.) If 
we replace the electron point charges by the smeared out spherical charges ju;, the 
electrostatic interaction among the electrons is reduced because the interaction 
energy between two spheres is less than or equal to that between two points. 
This follows from Newton’s Theorem, which also implies that the interaction 
between the smeared electrons and the nuclei is not changed, since the radius 
d; /2 is less than the distance of x; to any of the nuclei. Hence 


M 2 
Z Z 
Vc(X, R) = 5 2D(ui, Mj) — ys ‘| ix Rte + (R, — Ril 
J=lp3 k<l 


1<i<j<N 


= vA vA 
= piu.n)— > [ mA Fin > mom. 
J=lp3 : J 
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We have used that D(ju;, 14;) = 1/d;. We shall show that the right side of (5.3.3) 
is bounded from below by the right side of (5.3.2). 
An application of Theorem 5.3 yields the lower bound 


x i), 24 
V(X, R)>- >> | 305 rata Tg, re (5.3.4) 
J 


i=1 j=l 


To finish the proof it suffices to show that for x in the support of j4;, we have 
D(x) => d;/2. This follows from the triangle inequality, which implies that for 
any k 


|x — Ry| > |x; — Ry| — |x; —x| = dj —d;/2 = d;/2. (5.3.5) 


In the last step, we used that |x; — R,| > d; by definition, and |x; — x| = d;/2 
for x on the sphere centered at x;. | 


5.4 Refined Electrostatic Inequality 


As remarked in the previous section, the lower bound in Theorem 5.4 has the 
virtue of giving a lower bound to the many-body Coulomb potential Vc in 
terms of a one-body potential for the electrons, but it has the defect that the 
Coulomb singularity has the wrong prefactor 2Z + 1 on the right side, instead 
of the desired factor Z. This is of crucial importance when studying the stability 
of relativistic matter. The following theorem, which remedies this defect, was 
proved in [138, Thm. 6]. The price one has to pay is a slightly more negative 
one-body potential in the Voronoi cells I";, but we emphasize the fact that the 
singularities at the nuclei are unchanged, namely Z/|x — Rj|. 


Theorem 5.5 (Refined Electrostatic Inequality). For0 < 2 < 1 let W* be the 
function on R? whose value in the Voronoi cell I’; is defined to be 


Z yD; — Dy*|e— RiP)! for |x — Ril < AD; 
lx — Rj (/2z + +) Ix — R;\7! for |x — Rj| > 4Dj, 
(5.4.1) 
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with D ; defined in (5.2.2). Then, with X = (x1,...,Xn)and R = (Rj), ..., Ry) 
as before, 


i=1 


N ; vA M 1 
V(X, R= -)IWeD+> Da (5.4.2) 
jai. 


Remark 5.2. The optimal choice of ) in (5.4.1) that makes W* as small as 
possible is independent of X and R and depends only on Z. To see this, note 
that the upper choice in (5.4.1) is increasing in |x — R | and the lower choice is 
decreasing. Hence the optimal A is determined by setting the two terms equal, 
that is, 


1 V¥2Z+1/2 
21 —A2) — rN , 


(5.4.3) 


In other words, one can take the minimum of the two choices in (5.4.1) and forget 
about A. (When |x — R;| > Dj, but still in the Voronoi cell I’;, the first choice 
has to be interpreted as +-oo when taking the minimum.) When |x — Rj| => Dj;, 


W(x) = (WZ +: 1/72)?/|x — Ril. 


Proof. In order to be able to employ Theorem 5.3, we shall first smear out the 
electron charges in a definite (possibly non-spherical) way. This will decrease 
the contribution of the electron—electron repulsion to the Coulomb energy, but 
increase the electron—nuclear attraction. The measure (dx) will be taken to 
be the sum of all the smeared charges of all the electrons. The term D(j, 2) 
will underestimate the interaction of two electrons, which is fine, but it will also 
include a self-interaction term for each electron, which is not fine and which 
will have to be subtracted. These subtractions form part of the terms in (5.4.1). 
Another part of these terms arises from the fact that f (x) (dx) underestimates 
the electron—nuclear interaction. 

More precisely, we shall replace the point charge at x; by a charge distribution 
v; which is defined as follows. We distinguish two cases, depending on whether 
the distance of the electron x; to the nearest nucleus at R; is bigger or smaller 
than ADj;. 


Case 1. If |x; — R;| < 4Dj, the measure v; is supported on S;, the sphere 
of radius D; around R;. Note that by definition S; lies in the Voronoi cell 
I';. The charge v; is chosen in such a way that the potential created by v; 
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equals |x —x;|~! for all x with |x — R;| > Dj, that is, for all x outside S;. 
This property determines v; uniquely. Moreover, v; is a positive measure. This 
follows from the fact that the function x 1 |x — x;|~! — ie ly —x|~!v,(dy) is 
superharmonic’ inside 5; and vanishes at S;; hence itis a non-negative function 
whose normal derivative, which equals the surface charge by Gauss’s law, has 
to be non-negative.” 

The potential /’ 5 |¥—* |~'v,(dy) generated by v; can be evaluated explicitly. 
For |x — R;| > Dy itis |x = x;|~!, while for |x — R;| < Dj it is given by 

D; 1 


J 
5.4.4 
iz — Ryle — 2" one 


where xi — Rj = (x; — Rj)D¥/\x; — R;(’. The point x} is in fact obtained 
from x; by spherical inversion. That is, the potential inside S; is the same as 
the potential created by a point charge located at x7 outside S$; with charge 


Dj/|x; — Rjl. 
The self-energy of v; is then easily computed to be 
1 D; 1 1 
Dj, Vj) = (5.4.5) 


~ 2\x; —xF| |x; —Rjl 2D; 1- |x; — Rjl?/D? 


Case 2. Ifx; ¢ Ij and |x; — R;| > 4Dj;, we choose S; to be the sphere centered 
at x; with radius |x; — Rj;|/¢ for some ¢ > 1 to be determined later. Note that 
S; does not necessarily lie entirely inside .;. The measure v; is again supported 
on S;, but this time with uniform surface charge distribution. Its self-energy 
therefore equals 


4 


Dy, vi) = -——_|. 
a ee a 


(5.4.6) 


In both cases, v; is a positive measure, and 
v;(dx) Z 1 
Ix—y| ly — xi | 


i 


4 A function f is superharmonic if Af < 0 in the sense of distributions. It is a theorem [1 18, 
Thm. 9.3] that this is equivalent to the statement that for all x, f(x) is greater than or equal to 
the average of f over any sphere centered at x. 

> In physics this charge distribution is called the screening charge. It is the charge on a perfectly 
conducting grounded sphere that is induced by a unit charge located at a point x; inside the 
conductor. In mathematics it is known as the harmonic measure corresponding to the point x;. 
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for all y € R°. Using this twice, 
2D(0, vj) = ff MEY = f UO” 


Ix — y| Ix; —y| ~ |x; — x; 
SixS; S; 


In particular, 


|X; =; ~ 


a ee D(Dyw, Livi) — P| DO, wa. (5.4.7) 


i<j 


The self-energy terms are given in (5.4.5) in case | and in (5.4.6) in case 2, 
respectively. 
We now apply Theorem 5.3, which states that 


ae . od 
D(Syvis Ns) + 5 = = | O@) ide) + = YS — 
(Sov i) + ; |R;, — R| é . (x)vi( y+ 8 D; 
< f= 8) . 


j=l 


with ® given in (5.2.4). In order to calculate ‘i s, P(x) vi(dx) we again have to 
distinguish the two cases given above. 
Case 1. In this case, §; is entirely contained in some I";, and hence 


Z 


| P(x)vj(dx) = hee Som 2 ee 


kK, kA; § kK kAj 
since all the R, for k # j are located outside S§;. 


Case 2. Writing ®(x) = W(x) — Z/D(x) as in (5.2.4) and using the fact that 
all the R, lie outside S; by construction, we have 


es Z v;(dx) 
/ @(x)v;(dx) = > ye “| ay 


Recall that D(x) is the distance to the nearest nucleus defined in (5.2.3). Assume 
that x; € I’;. We claim that for all x € S; and all k 


[Ry —x| > Rj-xl(1-2). (5.4.8) 


This follows from the triangle inequality in (5.3.5), |Ry —x| > |Re —-xi| - 
|x —x;| = |Ry —x;| — |x; — R;|/¢. The fact that |R;, —x;| => |R; —x;|, by 
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definition of R; as the nearest nucleus, yields (5.4.8). In particular, O(x) = 
|R; —x;| (1 - 1), and hence 


[ eenan = Ee (5.4.9) 
2 —————————— —— A. 
J [Re=— xo —1 EAI |x; — Rx| 


The last term is the desired interaction energy with all but the nearest nucleus, 

and the first expression is an error term. 
Altogether, we have shown that 

Z Zz 


D(x; = 8 


M 
J= 


~ = ye F*(x;), (5.4.10) 


1 J i=l 


where F* is given as follows. In case 1, i.e., when x € I’; and |x — Rj| < AD;, 
we see from (5.4.5) that 


| roe 2 7 
F(x) = 5D "(1 — D7*|x — Rj?" 


In case 2, when x € I’; and |x — Rj| > ADj;, it follows from (5.4.4) and (5.4.9) 
that 


1 
F*(x) = ——— (¢/2+ Z/(¢ —1)). 
|x = R;| 


In order to minimize this last term in case 2, we choose ¢ = 1+ V2Z. This 
yields (5.4.2). | 


CHAPTER 6 


An Estimation of the Indirect Part of the 
Coulomb Energy 


6.1 Introduction 


In this chapter we shall bound the difference between the Coulomb energy 
of a system of many electrons in a state w and the electrostatic energy of 
the corresponding charge distribution o,,. This difference is also known as 
the indirect part of the Coulomb energy. Such a bound is not only rele- 
vant for the discussion of stability of matter, but is of importance in other 
areas such as density functional theory in quantum chemistry [51]. While the 
results on this chapter will be used in one of our proofs of stability of non- 
relativistic matter in the next chapter, they are not strictly needed for the proof, 
and we shall give a different proof avoiding their use. A reader who is only 
interested in the quickest proof of stability can skip this chapter. The results 
will be of importance in the discussion of relativistic matter in Chapter 8, 
however. 

As explained in Chapter 3, a quantum mechanical system of N particles has a 
state which, generally, is described by a density matrix. For our purposes here, 
only the (spin summed) diagonal part of this density matrix is relevant. Also, 
statistics does not play any role in this chapter; for our purposes here it does not 
matter whether we are dealing with Bose, Fermi or mixed statistics. In fact, the 
exchange energy defined below depends only on the N particle density (spin 
summed, in case of spin). 

We denote the N particle density as Py(x,,...,xXy). It should be thought of 
as |y(x1,...,xXy)|° or, more generally, as )°, |W(x1,01,...,XN, oy)|* in the 
case of non-zero spin, but that does not matter. The function Py is non-negative, 
and we assume it to be normalized, as usual, as 


[Preece ewes dey = 1 (6.1.1) 


R34 
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For particles with charges e;, 1 < i < N, the electrostatic energy is defined by 


Py(X1,---, 
-»> ves | ENO SUN oye (6.1.2) 


ope xX 
1<i<j<N il 


where e; is the charge of particle i; it is not assumed that all the e; are the 
same, or that Py is a permutation symmetric function of the x;. For studying 
the question of stability of matter, it is desirable to have a lower bound on /p in 
terms of the single particle charge density, which is defined for each x in R? by 


N 
O(x) = > Oi(x) (6.1.3) 


i=1 


and where the charge density of particle i is given by 


Oi(x) = e; ; PHM og Rite He Rapes pe dep dey, 
R3-) 


(6.1.4) 


As usual dx; means that the x; integration is omitted. Observe that Q;(x) is a 
non-negative function with integral 


/ Qi(x)dx = e; (6.1.5) 


(because of the normalization (6.1.1)). Hence the total charge is given by 


N 
/ O(x)dx = >a. (6.1.6) 
R3 i=l 


The electrostatic energy associated with the charge density Q(x) is given by 
(compare with Eq. (5.1.3)) 


D(Q, Q) = 


>| QO) aay 6.1.7) 


Ix — y| 
oe 
and is called the direct part of the Coulomb energy. It is the classical Coulomb 
energy associated with a ‘fluid’ of charge density Q. Since Q(x) > 0, D(Q, Q) 
is always well defined in the sense that it is either finite or +-oo. Accordingly, 
the indirect part of the Coulomb energy, denoted by Ep, is defined by the 
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equation 
Ip = D(Q, Q) + Ep. (6.1.8) 


Thus, Ep is the difference between the true energy /p and the classical approxi- 
mation D(Q, Q). Sometimes it is called the exchange plus correlation energy. 
It is the aim of this chapter to give a lower bound on Fp in terms of Q. We 
emphasize again that our bound on E'p holds for arbitrary normalized N-particle 
densities Py. 


6.2 Examples 


The first example comes from Hartree’s theory. Consider N spinless particles 
(i.e. g = 1), each with the same charge e. Assume that they are not correlated, 
in which case their wave function is given by a simple product 


w(X1,...,Xw) = filt1)-:- fu(ewn), (6.2.1) 


where each f; is in L7(IR*) and normalized. If all the f; are the same, this would 
describe bosons, but this is not important. The N-particle density in this case 
would be 


Py(x1,.--,Xv) = |W... ew = IAD? Lfv@wl?. 6.2.2) 


A simple computation yields 


N 
Q(x) =e) >| fi(x)I? (6.2.3) 


i=1 


and 


N 
Ip = D(Q, Q)-e? )) DUAL AD. (6.2.4) 
i=l 
Hence Fp is the (negative) sum of the self-energies of the charge distributions 
e| fi(x)|° in this case. 
Another example is provided by a Hartree-Fock wave function. Again the 
charges are all taken to be equal to e. Then w is the antisymmetric function of 
space and spin z = (x, 0) given by a determinant 


W(Z1,--., ZN) = (N!)~'? det(¢;(z;)) (6.2.5) 
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where the functions ¢; are orthonormal in L?(R*; C4). Using (3.1.28) and the 
equation (3.1.29) for the two-particle density, one sees that for the corresponding 
N-particle density 


Py(¥1j4004%N) = Solve, seater 


the indirect part of the Coulomb energy is 


(1) 
-5¢ ff Oe. al ASR (6.2.6) 


where 


N 
YM, 2) = D> GDR) (6.2.7) 


i=1 


denotes the one-particle density matrix of yr. In this context E’p is called the 
exchange term. 

An approximation to Ep in terms of Q was computed in [40] using perturba- 
tion theory, i.e. using the eigenfunctions of the kinetic energy operator (i.e. the 
Laplacian) in a large cubic box A. One chooses the ¢;(z) to be a product of spin 
and space wave function, i.e., 


buk(Z) = Xa er. (6.2.8) 


Tm 


where x_(o) with a = 1,...,q is an orthonormal basis in C7, |A| denotes the 
volume of A, and the allowed values of k are 


_ n . 3 
= Tale withn € Z;. (6.2.9) 


A dimensional argument immediately shows that for |A| and N large, and with 
o = N/|A| fixed, 
Ep = —Ce?q ‘70% AI. (6.2.10) 


Closer inspection shows that C = 0.93. 
Formula (6.2.10) suggests that in the general case, i.e., for N-particle densities 
coming from antisymmetric functions of space and spin, Ep given by (6.1.8) 
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should be bounded below by 


—Cerg O(x)*7dx (6.2.11) 
R3 


for some suitable universal constant C. For the case where all the e; equal e, 
(6.2.11) is correct provided the factor q~'/? is omitted. That the value of g and 
the Pauli principle play no role in the question of bounding Ep in terms of Q 
alone we shall show now. 

Given any symmetric particle density Py(x1,...,X,y), itis easy to find func- 
tions Wy, € Kn L?(R?; C?) for g = 1, 2,3,... each of which is antisymmetric 
in the z; = (x;, 0;) and such that all the functions yy, have the same N-particle 
density Py. To see this, define a function 6(x,,..., xy) to be antisymmetric in 
the x; and independent of the o;, and to take only the values +1. Consider the 
function 


Wa(Z1,---2n) = qn? Pu(x1, +. XN )O(K1,---,XN) (6.2.12) 


which is antisymmetric and independent of the o;. Obviously we have 


So lUg(Z1, Zw) = Py(x1,...,Xy) (6.2.13) 


for each x;,...,Xy, and for each value of g. Hence any symmetric N-particle 
density Py can come from an antisymmetric wave function with arbitrary value 
of q. 

Thus, the best general estimate we could aim for is 


fp Sate" / O(x)*Pdx (6.2.14) 
R3 

with C being a universal constant. One might try to improve the constant in 
the estimate by excluding certain symmetries for wave functions for example 
symmetric (i.e. bosonic) functions. That this is impossible can be seen in a similar 
fashion as above. Thus the Coulomb repulsion is insensitive to the symmetry 
properties of a wave function and is therefore not able to see the spin. The 
reason q entered in (6.2.11) was that the kinetic energy operator was taken into 
account, i.e., changing g meant changing the ground state of the kinetic energy 
operator and hence it meant changing Q(x). Our point is that when Q(x) is the 
only available information then (6.2.14) is the best one can hope for. This is in 
contrast to the kinetic energy which is sensitive to the symmetry properties of 
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the spatial part of the wave function. In fact, as was shown in Chapter 4, a spin 
dependent bound for the form 


ecg er i; QO(x)?3dx (6.2.15) 
R3 
can be obtained. The difference, i.e., the spin dependence of the kinetic energy 
estimate and the spin independence of the exchange estimate, can be (somewhat 
sloppily) rephrased by saying that only an off-diagonal operator (like the Lapla- 
cian) can ‘see’ the symmetry properties of a wave function and therefore the 
spin. 


6.3 Exchange Estimate 


The main theorem of this chapter is the following lower bound on Ep. The 
charges e; do not have to be equal to each other, but it is important that they are 
all positive or all negative. Note that e; Q; > 0 by definition. 


Theorem 6.1 (Exchange Estimate). Let Py be a normalized N-particle density. 
Assume that the e; have the same sign for all 1 <i < N. Then the indirect term 
Ep of the Coulomb energy given by (6.1.8) satisfies the estimate 


1/2 
N / 


/2 
4/3 
Ee=—c{ [(Seam) a} [( fiowra} . 63.» 
R3 


R3 i=1 
where C is some constant satisfying 
C < 1.68. 


Here Q(x) and Q;(x) are given by the expressions (6.1.3) and (6.1.4). In case 
all the e; equal a common value e then 


Ep> -ciel** [ |O(x) 4/7 dx. (6.3.2) 


R3 


Remarks 6.1. 


(a) This theorem was originally proved in [111] with the aid of the Hardy- 
Littlewood maximal function, with a bound on the constant C given by 8.52 


6.3 Exchange Estimate 111 


(b 


ma 


instead of 1.68. The improved constant was obtained by Lieb and Oxford in 
[125]. Moreover, by numerically optimizing the smearing function jy in our 
equation (6.6.9) below, it was shown in [29] that the Lieb—Oxford method 
presented here can yield C < 1.636. 

Since the number of particles is fixed one might expect that the sharp constant 
in (6.3.1) is N dependent. This is in fact true. In the case of one particle the 
constant C; can in principle be computed exactly. Since 7p = 0 in this case, 
Ep = —D(Q, Q) and we have that 


D(Q, Q) 
f O(x)*3dx 


It is not difficult to see that C is finite (see item (f) below). The existence of 
a maximizer was shown in [125]. The corresponding variational equation is 
the Lane—Emden equation of order 3, which was studied in [179] and [76]. 
In particular, its solutions are tabulated. The result is 


Ci = sup { : O(x) > 0, ; O(x)dx = 7 : (6.3.3) 


C; = 1.092. (6.3.4) 


This constant plays a role in the Chandrasekhar mass limit for gravitating 
bodies (see Chapter 13). The Lane—Emden equation goes back to Lane [101] 
in his study of gravitating gas spheres in the year 1870! 

A lower bound for C2 was computed in [125]: 


C2 > 1.234 > Ci (6.3.5) 
In general it is not hard to see that 
Cy < Cn4i, (6.3.6) 


and we refer to [125] for details. The constant C in (6.3.1), which is valid 
for all particle numbers, is the worst possible case, and we note (from the 
above bound on C2) that the bound 1.68 cannot be improved very much. 


(c) Itis not clear if the (unknown) optimal constant C in the theorem is improved 


by making the assumption that all the e; are equal. Our proof, however, does 
not get simpler or better in the equal charge case. 


(d) A slightly unpleasant feature of the bound (6.3.1) is that it does not just 


depend on the total charge density Q(x), but rather on the individual charge 
densities Q;(x). 


(e) Note that there is no upper bound on Ep in terms of the one-particle density 


Q. Even for a nice, smooth Q, Jp (and hence Ep) can be +00. For example, 
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if P)(x1,x2) = C exp(—|x1|? — |x2|?)/|x1 — x2|? then this is the case. This 
example also shows why it is necessary that all charges have the same sign 
in Theorem 6.1 for, otherwise, Ep could be —oo. 

(f) This last remark is of a more technical nature. Since Ep is the difference 
of two positive quantities and since the only assumption on Py is that it is 
normalized, the reader might worry that E p is not well defined. Conceivably 
Ip and D(Q, Q) could both be infinity and yet Ep, being the difference of 
the two, is somehow finite. This does not affect the validity of Theorem 6. | 
as the following reasoning shows. We can assume that f |O(x)|*/3dx < 00 
for otherwise there is nothing to prove. By the Hardy—Littlewood—Sobolev 
inequality (see [118, Thm. 4.3]) we have that 


D(Q, Q) < CllOllés (6.3.7) 


and by Holder’s inequality 
1/3 1/2 


| Olle/s < / | Q(x) dx / Oe) 43x 
R3 R3 


A 


1/3 1/2 


/ \O(x)|\*7dx} (6.3.8) 
R3 


N 


i=1 


Hence, whenever Q € L*/3(R°) (so that the right side of (6.3.1) and (6.3.2) 
is finite) D(Q, Q) is finite and Fp is well defined. It might be +00, but 
never —Oo. 


The proof of Theorem 6.1 is contained in Sections 6.4—6.6. For simplicity, 
we shall consider the case that all e; > 0 even though the main application of 
Theorem 6.1 is to electrons for which e; = —e < 0. This is done to avoid writing 
absolute values everywhere, but it is of no consequence since the only relevant 
quantities are the products e,e; > 0. 


6.4 Smearing Out Charges 


The first step in the proof of Theorem 6.1 is a generalization of a lemma originally 
due to Onsager [145]. 


6.4 Smearing Out Charges 113 


Lemma 6.1 (Onsager’s Lemma). Consider N positive charges e; located at 
distinct points X,,...,Xy in R?. For each1 <i < N, let [Lx, be anon-negative, 
bounded, function that is spherically symmetric about x;, with + Ly, (x)dx = 1. 
Then for any non-negative, integrable, function @ there is the inequality 


N N 
ee; 
Lae P+ 2D (e:DQ ts, — De? Ditts;s Ha,) 641) 
J i=l i=1 


Proof. The functions jx, being bounded guarantees that D(o, x,) and 
D(tx,, 4x,) are finite. We can assume that D(@, Q) is not infinite, because if 
it were infinite the right side of (6.4.1) would be —oo and the lemma is trivial. 
We know from Theorem 5.1 that D(-, -) is positive definite and hence 


N N 
(0 Seine Yet >0 (6.4.2) 
i=] i=1 


which implies that 


N N 
e7e; D(x, Mx;) 2 —D(@, e) +2 > €; DQ, [kx;) — > e; D(ix,, Mx;): 


ifj i=l i=l 
(6.4.3) 
Since 4, and jy, are non-negative and spherically symmetric around the centers 
x; and x ;, we know from Newton’s Theorem (Theorem 5.2) that 
1 
| on RT 
"2 |x; — x; 


which proves (6.4.1). | 


D(x; ’ Mx, (6.4.4) 


Remark 6.2. The point of the above lemma is that it estimates a quantity 


- ee (6.4.5) 


fay i yl 
in which correlations are important by another one where the correlations among 
the x; are not important. That is, the right side of (6.4.1) depends only on how 
the points x; are distributed relative to 9, but not relative to each other — as 
opposed to the left side of (6.4.1). 


Lemma 6.1 immediately allows us to get a lower bound for Ep in terms 
of the one-particle densities Q;(x) and Q(x) = ae Q;(x). Choosing 9 = Q, 
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multiplying (6.4.1) by the N-particle density Py(x1,...,xy) and integrating 
we arrive at 


N 
Ip = = DQ.9)+2> f DQ. 1s.) Oila bu 


i=1 R3 
N 
= Se f Dies, 4s )QiGax: (6.4.6) 
i=1 R3 
The normalization i Py(x1,...,Xy)dx,---dxy = 1 has been used here. If we 


denote by 46,, the Dirac measure at the point x; we can write, in a somewhat 
formal but illuminating fashion, 

N 
> i D(Q, b;;)Qi(xi)dx; = D(Q, Q). (6.4.7) 
i=l ps 
Hence, by adding and subtracting we get from (6.4.6) 


ee ee (6.4.8) 
where 
N 
F=29> f 000.55, = wy) Ote du (6.4.9) 
i=l ps 
and 
N 
m= ye if D( tty,» be, QiCe)dx;. (6.4.10) 
i=1 


Clearly F2 is positive. Observe that F; is also positive (by Newton’s Theorem) 
for any choice of the jux,. 


6.5 Proof of Theorem 6.1, a First Bound 


In this section, we shall prove Theorem 6.1, but with a worse bound on the 
constant C than the one stated in the theorem. In the next section we will explain 
how to improve the method in order to obtain the better bound. 

We start with the bound (6.4.8) proved in the previous section. In (6.4.9) and 
(6.4.10) we are still free to choose the functions px,. It is clear that jz, has 
to depend on Q; for otherwise F, would be quadratic in Q; and would not be 
proportional to the 4/3 power of Q;. 
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Let w : R? — R be a non-negative bounded function satisfying 


(a) 2 is spherically symmetric about the origin, 
(b) fos u(y)dy = 1, 
(c) u(y) = Oif|y| > 1. 


As already said, we assume, without loss of generality, that all e; are positive. 
Given yz and some positive number A, we choose the 1x, to be 


bx (¥) = BOX) MAO)’ F(x; — y)). (6.5.1) 


Note that since Q is an integrable function, this is well defined for almost every 
x;. Moreover, it is easily seen that the functions jz,, obtained in this way satisfy 
the assumptions of Lemma 6.1. 

The zx, constructed in this way describes a charge distributed over a length 
scale determined by the value of Q at x;. The factor A could be set equal to one, 
but we introduce it in order to optimize the resulting constant at the end. 

If we denote 


R,(a,r) = ar! — ra* oral), (6.5.2) 


where ¢ is the potential associated with J, i.e., 


My) =f 4. A 
P(|x|) -| dy = [nin {—. I w(y)dy (6.5.3) 
Ix — y| Ix| lyl 
R3 R3 
(see Theorem 5.2), then a simple computation shows that 
Fi= | f ouyR.(OU), = yhaxay. (654) 
R3 xR3 


Moreover, since 


S13 (y, — rc 
Diitsys ts.) = 2° 0¢4)% ff M(AQ(Xi) (Xi eens (ey: 2) aydz 


R?xR? 
= 2Q(x;)'7 D(u, 1») (6.5.5) 


we find that 


i=1 


N 
P= 2D. 0) f O(0)"” (> é 0.0) dx. (6.58) 
R3 
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Incase e; = eforalli = 1,..., N, Fy is already of the desired form. Otherwise, 
we use Holder’s inequality to conclude that 


1/4 3/4 


N 4/3 
Fy < 4D(u, w) / O(x)*dx / (S«00) dx}. (6.5.7) 
R3 


R i=1 


We concentrate on F; and prove first a crude estimate which hopefully clarifies 
what the choice of jz,, in (6.5.1) accomplishes. Since jz is non-negative and 
f u(y)dy = 1, (6.5.3) shows that ¢(r) < r~! which implies that R,(a, r) > 0. 
Further, since u(y) = 0 for |y| > 1, we find again by (6.5.3) that Ry(a,r) = 0 
if Aa'/?r > 1. Hence we have the simple bound 


ar ifdra'3r <1 
Ria,r) < (6.5.8) 
0 otherwise, 
which implies that 
F< /| QO) aray, (6.5.9) 
|x — y| 


AQ(x)!8|x—y|<1 


The restriction upon the integration in (6.5.9) obviously plays an important role. 
To make use of it we resort to the following device. Write Q(x) = ie Xa(x )da 
where x, is the characteristic function of the set {x : Q(x) > a}, ie. 


Xo(x) = ' asin (6.5.10) 


0 otherwise. 


Using this and Fubini’s theorem to change the order of integration, the right 
side of (6.5.9) becomes 


CO CO 
[ox fap // Xa)XOM) gray 
Ix — y| 
0 0 2O(x)'3|x-yl<1 


[oe] 


< [cw fap i ee dxdy. (6.5.11) 
0 0 


Aa!/3\x—y|<1 
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Where a < 6 we bound the integrand from above by 


If 2 EEL Zaxdy = | xolrnty ; |x|! 


dol! |x — ‘aa R |x| sQal/3)7 


20 
= jaya | XeQdy (6.5.12) 
R3 


and where a > 6 we bound it by 


Xa(y) 20 
Hasse ye |x — hi = 2022/3 | Rees ae 


R3 


Therefore (6.5.11) is bounded above by 


lo) Qa 


= fn fo [row Bay + f dee fap frets Pax 
0 


0) 0 R3 


-S fe Ye J (x)dx 


“* / O(x)*Bax. (6.5.14) 


Returning to (6.4.8) and using (6.5.7) we have that 
670 
Bee oo | O(x)*dx 


1/4 4/3 3/4 


N 
— AD, #) / O(xy"dx / (+0. dx 
R3 i=l 


R3 


(6.5.15) 


Maximizing the right side over A yields the bound 


34/3 ip N 4/3 
Ep > —S yr? (D(w, wy if O(x)*Fdx if (Lea) dx 
R3 RT 


(6.5.16) 


1/2 
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In order to minimize D(,1, i), the optimal choice for ju is 


1 

Wy) = 7 My = 1); (6.5.17) 
1 

in which case D(w, 4) = 1/2. This yields 
1/2 1/2 
343, on N ue 
Ep se p [ omar / 221010) dx 
R3 R VE 
(6.5.18) 


Note that 34/3771/3 /2 = 3.17. 


6.6 An Improved Bound 


One can improve the constant 3.17 in the bound (6.5.18) by replacing inequality 
(6.5.8) by a more sophisticated treatment. Since R,(a, r) in (6.5.2) is continu- 
ously differentiable we have, using the fundamental theorem of calculus, that 


O(n) 
a 

[ R00, Ir — ybde = f dx / (<8) (a, |x —y|)da = (6.6.1) 

R3 R: 0 


and, again using the definition of x, in (6.5.10), this can be written as 


[ox / Xa(x) (=-8:) (a, |x — y|)da. (6.6.2) 
R3 0 


By inspection of (6.5.2) and (6.5.3), one easily sees that oR 


and hence we can write 


(a,r) is bounded 


Fi = // O(y)R,( Q(x), |x — y|)dxdy 


R?xR3 

love) loo) DR; 
= | da | dB XalX)Xa(W— (oe, |x — y|)dxdy. (6.6.3) 
0 0 R?xR3 


OR, 
da 


An upper bound is obtained by replacing 


denote as eel = max{ oR , O}. 


by its positive part, which we 
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The right side of (6.5.11) can be recovered from (6.6.3) by replacing 


Pe le (a, r) by the function r-'@(1 — A!?ar), where © is the Heaviside step 


function 
0 ift <0 
O(t) = 6.6.4 
#) i ift > 0. : 
The integral hee @(1 — A! a@|x|)|x|7!dx in (6.5.12) is now replaced by 
OR 
/ Ea (a, |x|)dx. (6.6.5) 
da |, 


R3 
To compute this integral observe that 
Ry(a,r) = A04PGQa'?r), (6.6.6) 


where G(t) = 4 — d(t) for t > 0, with @(¢) given in (6.5.3). Hence 
OR OR 
—(a,r) = Aa! ( —* | 1, Aa!r) (6.6.7) 
da da 

and, therefore, (6.6.5) becomes 


(aol)? / | (1, |x|)dx = Qa!?)?K (yu), (6.6.8) 
R3 * 


where K (jz) depends only on jz. Following through all the steps in the previous 
section with the corresponding replacements we end up with the following 
estimate (compare with (6.5.16)) 


4 4/3 1/2 1/2 
Ep > —S16K WD, WI ( i (3-00) ) ( / as 
R3 


5a Mel 
(6.6.9) 


In the previous section, we chose jz to be a constant surface charge distribution 
on the unit sphere which, by making a crude estimate on K (jx), yielded the 
constant 3.17. Computing directly from formula (6.6.8) yields the constant 1.81. 

By choosing jz more cleverly this constant can be improved. We choose jz 
to be the uniform distribution of a unit charge smeared out in a ball instead 
of a sphere, i.e., u(y) = Mo(y) := (3/47) 0d — |y|). A computation of @(r) in 
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(6.5.3) yields 


3(1-4r’) if0<r<l 
d(r) = i : ; (6.6.10) 
3 ifr > 1, 
and hence 
Oe ar~! — 3ra4/3 (1 — $(Aa!/3r)?) — a7r20e?) 
0 tr Ga), 
(6.6.11) 
Therefore 
aR =) =r—77), O<7 <1, 
2 he ki SESE) Vers (6.6.12) 
da 0 1l<r. 
This function is non-negative for 0 <r < wl and so 
59 5 
K (uo) = 4x | — — —V5 ) = 0.6489. 6.6.13 
(Mo) = 4x & D ) ( ) 
An elementary calculation shows that 
3 
D((0, Lo) = 5 (6.6.14) 
and hence 
C < 1.68, (6.6.15) 


which proves Theorem 6.1. | 


CHAPTER 7 


Stability of Non-Relativistic Matter 


With the necessary preliminaries in place we can now turn to proofs of stability 
of matter. The simplest model of matter for which one wants to prove stability 
of the second kind is the conventional non-relativistic Hamiltonian dating back 
to the earliest days of quantum mechanics. It is also the first example for which 
stability was proved; this was done by Dyson and Lenard in 1967 [44]. With the 
aid of the inequalities proved in the previous chapters, the proof turns out to be 
rather short. 

One consequence of stability of the second kind is that matter is extensive, and 
this connection will be explained in Section 7.5. Alternative proofs of stability 
will also be briefly discussed. This chapter will conclude with a demonstration 
that stability of the second kind does not hold for bosons. 

In units described in detail in Section 2.1.7, the Hamiltonian under consider- 
ation is 

N 
H=~)*(-iV;+VaA(x))) +aVc(X, R), (7.0.1) 


j=l 


Nile 


where a > 0 is the fine structure constant, and where Vc(X, R) is the total 
Coulomb potential energy, defined in (2.1.21), 


1 eee Z; ZZ] 
Vc(X, R) = 3 moe Lie + Ps |R, — R)| 


l<i<j<N i l<k<l<M 


(7.0.2) 


Here, N is the number of electrons, and there are M nuclei with charges Z; > 0 
located at distinct points R;. The magnetic vector potential A is an arbitrary 
function in ie OR) with values in R?. 

If the interaction of the spin of the particles with the magnetic field is taken 


into account, there is a contribution ./aB -o to the energy, as explained in 
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Section 2.1.6. In this case, stability still holds, but only if one takes the energy of 
the magnetic field into account. Even then, it will be necessary to have a bound 
on Za? and a. The proof of stability is quite a bit harder in this case, and we 
defer its discussion until Chapter 9. 


7.1 Proof of Stability of Matter 


Theorem 7.1 (Stability of Non-Relativistic Matter). Let Z = maxj;{Z;}. For 
all normalized, antisymmetric wave functions W with q spin states, 


(Ww, Hw) = —0.231 «2Nq?/ (1 + 2.16 Z(M/N)"3)’. i} 


We note that since N!/3M?/3 < N + M, this yields the desired linear lower 
bound. The importance of the linear dependence on the total number of particles 
was explained in the Prologue and in Section 3.2. 

The bound (7.1.1) is not optimal, in the sense that for N >> M the lower 
bound should depend only on M, whereas for N « M it should only depend on 
N. We note that a bound depending only on N can be deduced from relativistic 
stability discussed in Chapter 8, as explained in Remark 8.6. Moreover, a bound 
that depends only on M can be deduced from Theorem 12.1 in Chapter 12, 
where it is shown that the number of electrons that can be bound is at most 
ea 4 + 1) < 2Z+ 1)M, that is, the ground state energy of H for N > 
(2Z + 1)M is equal to the ground state energy with N equal to the largest integer 
<(2Z+1)M. 

In the case of neutral hydrogen, Z = 1, M = N and gq = 2, the bound (7.1.1) 
yields} 7.29 Rydbergs |per nucleus as a lower bound. (Recall from Section 2.1.7 


that in our units an energy of a? is 2 Rydbergs.) In case the optimal constant in 
the kinetic energy inequality (4.2.10) equals the conjectured semiclassical value 


(3/5)(677)?/3, the bound could be improved to} 4.90 Rydbergs | for hydrogen. 


Proof. We start by noting that without loss of generality, we may assume that 
all the Z; equal a common value Z. This follows from the monotonicity of the 
ground state energy in the nuclear charges, which was discussed in Section 3.2.3. 
Thus we can assume that Z; = Z henceforth. 
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According to Corollary 4.1 in Chapter 4, we have 


1 K 
5h ECV) + aarp? y | = 547? : oy(x)Fdx, (7.1.2) 
j ne 
with K > (9/5)(477)!/3 ~ 3.065. Here, Qy denotes the particle density, defined 
in (3.1.4). We point out that for this bound to hold, the antisymmetric nature of the 
wave functions is essential. It allows us to bound the spin-summed one-particle 
density matrix of y from above by gq times the identity. 
To obtain a lower bound on the Coulomb energy, we first employ the exchange 
estimate of Theorem 6.1, which states that 


VW, 2 ea > Dey, ev) — 1.68 f oy(xy"dx, (7.1.3) 
igo! R 


with D denoting the direct electrostatic energy (5.1.3). The electron—nuclear 
interaction is 


N M 
(E> ea Ril v= 3 re wR eRe: (7.1.4) 


so the total Coulomb energy is bounded below as 


“ Z 
(W, Vc W) = Dy, Qy) — ae nero 
k=1p3 K 


+U(R) — 1.68 f oye)" ax, (7.1.5) 
R3 
(Recall the definition of the nuclear repulsion energy UCR) in (2.1.24).) 
A lower bound to the ground state energy of H can be obtained by combining 
(7.1.2) and (7.1.5) and minimizing 


K 
P= 37" o(x)*/3dx + wD(o, 0) + aU(R) 


R3 


- y i ea Rae = 1.680 / o(x)3dx (7.1.6) 


R3 
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under the condition that hes o(x)dx = N. This is the Thomas—Fermi—Dirac 
problem, which is extensively studied in [113]. The simpler Thomas—Fermi 
functional (without the last term in (7.1.6)) will appear again in Section 7.3. 

Instead of pursuing that route, it is simpler (for the purpose of our theorem) 
to use the electrostatic inequality of Theorem 5.3, which implies that 


Z 
Dioy, w- 2 [er pen id + UCR) = -| Seve. 
R3 


(7.1.7) 


where (x) = min; |x — R;| denotes the distance of x to the nearest nucleus. 
The last positive term in (5.2.6) has been dropped for a lower bound, since it is 
not needed for the following argument. In brief we are using the bound 

(w, Vo w) > —1.68 / oy (x)*P7dx — ls ——.0y(x)dx. (7.1.8) 


R R 


z 
D(x) 


The { oy > term can be bounded via the Cauchy-Schwarz inequality as 


1/2 1/2 
/ Oy(x)*7dx < ; Oy(x) dx i Oy, (x)dx 
R3 R3 R3 
a N 
< = | evan*ax ie (7.1.9) 


R3 


for arbitrary a > 0. With the choice a = Kq~7/¢/(1.68a) for some 0 < ¢ < 1, 
this implies the lower bound 


22/3 
(v, HW) = ~(0.84 in sa = oy (x)53dx 
1 
a | Sevendx. (7.1.10) 
R3 


We pick b > 0 and write D(x)! as (D(x)! — b) +b. The contribution of 
the last term to the integral in (7.1.10) is simply DN. Moreover, by minimizing 
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over all non-negative functions @,,(x), we find that 


K 1 
37 _ 0) f ov) ax = za | (= _ ») Oy (x)dx 
R3 R? 


2q 6 ee ais 1 ve 
> res (sq=5) (Za) / Ea -0] dx. (7.1.11) 


Recall that [ - ], denotes the positive part. Since 


I 5/2 I 5/2 M 1 5/2 
=$| = ey pe 
Eo 1zkeM E Ril =), E Ril 


le + k=1 


we can bound 


l 5/2 1 5/2 Cee 
/ Ea — o| dx <M 7 (= — ») dx = Mb-?, (7.1.12) 
D(x) + |x| 4 


R3 |x|<1/b 


After choosing the optimal value of b, we thus obtain the lower bound 


Ae2g2/3N 18 5. 2/3 
(v, HW) = — (0.84) <2 — _ NB Y?23(Z0)2 (: “) . 
é 


5K(—«)\ 4 
(7.1.13) 
Finally, after optimizing over ¢, we have 
q2/3.q2 2 
(W, HW) > — a N (0.84 + Z¥/D(M/N)'/*) . (7.1.14) 
where D = (9/10)m4/32-1/3 = 3.287. As stated in Corollary 4.1, K > 
(9/5)(4r7)!/3 = 3.065. This yields (7.1.1). | 


7.2 An Alternative Proof of Stability 


The proof of stability of non-relativistic matter given in the previous subsection 
relied on three essential ingredients. One is the LT inequality, which relates the 
kinetic energy of fermions to their density 0,,(x). The second is the estimate on 
the indirect (or exchange) part of the Coulomb energy, which gives a lower bound 
on the electron interaction in terms of the classical electrostatic energy of the 
electron charge distribution. Finally, the electrostatic inequality of Section 5.2 
relates the total classical electrostatic energy to the interaction energy of the 
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electrons to the nearest nucleus only. In the end, one has to deal with the @>/? 
semiclassical energy, the negative o*/* exchange estimate, and the negative 
nearest neighbor electron—nucleus interaction. 

One can eliminate the use of the exchange estimate in Section 7.1 by replacing 
Theorem 5.3 by its corollary, Theorem 5.4. This is the most direct route to the 
proof of stability, but it obscures the role of the non-classical exchange (or 
correlation) energy, and it leads to slightly worse constants. The idea for this 
route to stability is due to Solovej [173]. 

The electrostatic inequality of Theorem 5.4 implies that 


N 


Vc(X, R) > -—2Z +1) 
> D(x;) 


where (x) = min, |x — Rx|, as before. We have dropped the positive last 
term in (5.3.2), since it will not be necessary in the following. We shall write 
again D(x)~! = (D(x)! — b) + b for b > 0. The LT inequality in Theorem 4. | 
implies that for antisymmetric functions with qg spin states, 


- ae 2Z+1 : b 
2. EB pees » (Se - )] 


> —qly32°? a? (QZ + ey 


R3 


5/2 


The additional factor 2°/* results from the factor 1/2 in front of the 
Laplacian, which is absent in (4.1.11). As explained in Section 4.1.2, 


I143< a Lys ~ 0.0123. After using the bound (7.1.12) on the last integral we 


obtain the lower bound 
5 2 
H> —ghi32°? QZ + 1)905/2Mb-"? — (2Z + la Nb. 
After inserting the optimal value of 5, this yields 


3 
H > 551 L137 q?? (QZ + oP MONI" 


> —1.073 977 [(22Z + Da? M73N', (722) 


7.3 Stability of Matter via Thomas—Fermi Theory 127 


In the case of neutral hydrogen, Z = 1, M = N and q = 2, this bound yields 


30.52 Rydbergs | per nucleus as a lower bound, which is about a factor 4 worse 


than our previous bound. 

We remark that the method for proving stability of matter given above can 
be extended to yield a lower bound on H that depends only on N and not on 
M. In order to do this, one must not drop the positive nearest-nucleus repul- 
sion in Theorem 5.4, which we could afford to do above. This was shown in 
[83, Thm. 3 of Part H]. Alternatively, one can use the stability of relativis- 
tic matter to obtain such a bound, as explained in Remark 8.6 in the next 
chapter. 


7.3 Stability of Matter via Thomas—Fermi Theory 


The electrostatic inequality of Theorem 5.3, together with the exchange esti- 
mate of Theorem 6.1, allowed us to bound the total Coulomb potential energy 
Vc(X, R) from below by another kind of potential energy, which is just the sum 
of one-particle terms corresponding to the attraction to the nearest nucleus. For 
this latter potential energy it is then clear that binding between atoms does not 
occur, and hence stability of the second kind reduces to stability of the first kind. 
That is to say, in such a model it is clear that the lowest energy occurs when 
the Voronoi cells are as large as possible, which occurs when the nuclei are 
infinitely far apart. This reasoning is one of the key ingredients in the proofs of 
stability of matter given in the previous two sections, the other being the kinetic 
energy inequality in Corollary 4.1. In real life, atoms do bind together, of course, 
but there is nothing wrong with a lower bound on the total energy obtained by 
estimating the energy from below by a model that does not lead to binding. 

There is another way of bounding the total energy from below by an expression 
that prohibits binding. This is the original route taken by Lieb and Thirring in 
[134] and uses the fact that binding between atoms does not occur in Thomas— 
Fermi theory. Since we have already given a complete proof of stability of 
non-relativistic matter in the previous section, we shall only give a brief account 
of the proof via Thomas—Fermi theory here. Our exposition will not be complete, 
since we refrain from proving the no-binding theorem. For details, we refer the 
interested reader to [130, 108, 113]. 

The Thomas—Fermi functional has, as its motivation, a semiclassical approxi- 
mation to the energy of quantum-mechanical system with particle density o(x). 
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It is given by 


TE.) — > mal er ae [ieee (x) 
En) = a5 o(x)Pdx 2 RI” 


2002) M ZiZjot 
7.3.1 
+S pee ee Da 7 IR; — Ril" — 


ae 


The first term, with y = (677)’/?, is the semiclassical approximation to the 
kinetic energy, as explained in Chapter 4. It will be convenient not to fix y but 
rather keep it as an adjustable parameter, as will become clear below. We will 
always assume that y > 0, however. 


If N denotes the number of electrons, the ground state energy in Thomas— 


Fermi theory is given by 


EN, Z, R, y) = inf{ E™ (0): 9 € L°7(R°), o(x) = 0, 7 o(x)dx = N 


R3 
(732) 


It has several remarkable properties. 


E™N,Z,R,y) = E™©), Zi, Z, R, y) for all N > >°, Z;. That is, the 
Thomas—Fermi energy can not be decreased by adding more electrons than 
the total nuclear charge of the system. In fact, one can show that a minimizer 
in (7.3.2) only exists for N < 0, Z;, not for larger N. The excess charge 
simply moves to infinity, and the system stays neutral. 

ETN, Z, R, y) > min pa E™ (Ni, Z;, Ri, vy): Ni; = 9, eae N; = nj. 
This is the no-binding theorem, since it says that moving the nuclei infinitely 
far apart will lower the energy. Note that the Thomas—Fermi energy for a 
single atom, E TF(N, Z, R, y), is independent of the nuclear position R. 
EM(N, 7,0; y= ZIBy ETN /Z, 1,0, 1). This simple scaling property 
follows easily by scaling (7.3.1) appropriately. 


In combination, these properties imply the lower bound 


M 
E™(N,Z,R,y) = y"E™ (11,0, > 27" = y TE™G, 1,0, 127M, 


i=1 


(7.3.3) 
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which is stability of the second kind. A numerical computation shows that 
E™(1, 1,0, 1) = —7.356 a7. 

The importance of the Thomas—Fermi functional for the problem of stability 
of matter comes from the observation in [134] that the Thomas—Fermi energy 
(for an appropriate value of y) is a lower bound to the quantum-mechanical 
ground state energy. In fact, we already have all the necessary inequalities at our 
disposal to show this. 

The kinetic energy inequality (4.2.10) implies that the first term in the 
Thomas—Fermi functional, with y = 6(2r7/2)!/3, is a lower bound to the kinetic 
energy of N electrons. The Lieb—Oxford inequality! (6.2.14), in combination 
with a simple Cauchy—Schwarz estimate as in (7.1.9), then implies that 


N 2 

(w, Hr) > E™(N, Z, R, 6(02/2)"/3 — 5q?/31.68a/3) — 1.68—5— (7.3.4) 
a 

for any a > 0. By using (7.3.3) and optimizing over a one obtains the bound 


2 


M 
1 
(Ww, Hw) > —0.231q73a?N | 141.77 ee” (7.3.5) 
j=l 


For the hydrogen gas, (7.3.5) implies a bound of | 5.60 Rydbergs | per atom, 


which is slightly better than the one obtained in Theorem 7.1. 


7.4 Other Routes to a Proof of Stability 


In the preceding three subsections, we have given several proofs of stability of 
non-relativistic matter with electrostatic forces. Historically, the first proof of 
this fact was given by Dyson and Lenard in 1967. Subsequently, many other 
proofs were given. None of these proofs, however, yields a bound on the optimal 
constant (in the neutral case) that is as good as the one obtained in the previous 
subsection using Thomas—Fermi theory and the Lieb—Oxford inequality. We 
give a brief summary of some of the early work here. 


' In the original proof of stability of matter by Lieb and Thirring, the Lieb—Oxford bound was 


not available. Instead, a weaker bound on the exchange energy of the form (f @°/*)'/7([ @)!/? 


4/3 


in place of [ 9*/? was used. 
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7.4.1 Dyson—Lenard, 1967 


The first proof of stability of non-relativistic matter was given by Dyson and 
Lenard in 1967 [44]. This proof was one of the most complicated proofs ever 
to appear in mathematical physics. For many years, it was regarded with awe, 
like Onsager’s solution of the two-dimensional Ising model in 1944 [146]. The 
key question that had to be addressed was how to bring in the Pauli exclusion 
principle. In the proof by Lieb and Thirring, it is this principle that lies behind 
the f o°/> bound on the kinetic energy. This is a global result. In the Dyson— 
Lenard proof, in contrast, the Pauli principle is used locally to mandate that 
for N > 2 (spinless) fermions in a ball of radius £, the kinetic energy (with 
Neumann boundary conditions) is at least (NV — 1)/€, since at most one such 
fermion can be in the zero energy state. 

The lower bound obtained in [44] yields about 10'4 Ry per particle for a 
hydrogen gas. This is, of course, an unrealistically large number and resulted 
from the many different inequalities that had to be used. 


7.4.2  Federbush, 1975 


Federbush [55] realized that the methods of constructive quantum field theory 
could be utilized to solve the stability problem. Although his paper is not very 
transparent, it is much shorter than the original proof by Dyson and Lenard. An 
estimate on the constant appearing in the inequality is not given, however. 


7.4.3 Some Later Work 


Other approaches for proving stability of non-relativistic matter were developed 
by Fefferman [56] and Graf [79], although without estimates of the constant 
in the inequality. In [56] results on extensivity of matter and properties of the 
ground state density are also obtained by dividing space suitably into boxes. In 
[79] the necessary electrostatic inequality is obtained by decomposing space into 
a lattice of disjoint simplices and, by averaging over translations and rotations of 
this lattice, a lower bound on the Coulomb energy is obtained by restricting the 
Coulomb interaction to pairs of particles belonging to the same simplex. The use 
of averaging to get a lower bound (but with cubes and only with the translational 
and not the rotational averaging) goes back to [34]. Finally, the Pauli principle is 
used locally in every simplex in the form of a lower bound on the kinetic energy 
in terms of the 5/3 power of the particle number in the simplex. 
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7.5 Extensivity of Matter 


Suppose we have anon-relativistic Coulomb system composed of several species 
of particles, of which the negative ones are fermions. The nuclei can be static 
or dynamic. By Theorem 7.1 and Corollary 4.1 there are constants A > 0 and 
K > Osuch that 


E(W)>—-AN and Ty>K / oy! (x)dx (7.5.1) 
R3 
for any normalized y. In fact, K > (9/10)(27/q)*/3. Here E(y) is the energy 
functional (w, Hy), Ty, is the kinetic energy of the negative particles, and oy, is 
the density of the negative particles (see Eq. (3.1.4) in Chapter 3). There are N 
electrons and M nuclei. The constant A depends on Z and the ratio M/N. 
The following theorem shows that any wave function w with bounded energy 
per particle is necessarily spatially extensive, in the sense that the particles 
occupy a volume proportional to NV. 


Theorem 7.2 (Extensivity of Matter). Let y be any normalized function with 
energy E = E(w), and let A and K be constants such that (7.5.1) holds. 


(i) For each p > 0 there is a universal constant y, such that 
1/p 
1 -1 
— / Oy(x)ix|’dx } > ypK'/? (E/N +A+ VA) nv. 


N 
R3 


(7.5.2) 
Thus, the average distance to the origin, 0 € R?, among all the particles 
is at least as great as N\/3 
can make the same statement relative to any point y € R°. The ‘volume of 


. Since the choice of the origin is arbitrary, we 


matter’ is, therefore, proportional to the number of particles. 
(ii) If Q is any measurable set in R? 


1 3/5 6/5 (vol(Q)\*/° 
W / Oy(x)dx < K (VEIN +A+ VA) ( ) . (75.3) 
Q 


Thus, if contains a non-vanishing fraction of the particles then vol(Q2) 
has to be bounded below by a constant times N. 


Remark 7.1. A special case of (7.5.2) was given in [135]. The general case 
appeared in [108]. Inequality (7.5.3) is due to Thirring [178, Sect. 4.3.5]. 
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Proof. (i) First we derive an upper bound on the kinetic energy Ty,. Write 
BS e) Ale =A + Va (7.5.4) 


with 0 <2 <1, and note that the last two terms are bounded below by 
—AN/(1 — A). This follows from the stability bound and scaling, i.e., the fact 
that the kinetic energy has the dimension of the square of an inverse length and 
the potential energy has the dimension of an inverse length. Thus we have 


E AN E+AN AN 
Ty <—+ = + ; 
i i t=) 


(7.5.5) 


Note that the right side of (7.5.5) tends to +00 as 4 approaches 0 or 1 since 
E+ AN > 0. Minimization of (7.5.5) with respect to A yields the bound 


2 
ce (VE TAN + VAN) (7.5.6) 
With the second inequality in (7.5.1) we reach the basic inequality 
2 
K / oylx) dx < Ty < (VE TAN + VAN) (7.5.7) 
R3 


Next, we assert the following general fact [108, p. 563] about functions 9 
from R? to R*: 


p/2 14+5p/6 


[ oar [ Proce ax a OF [ ecoar » (7.5.8) 
R3 


R3 R3 


for all p > 0. The sharp constant, C,, is obtained by inserting o(x) = (1 — 
|x|?)?/? for |x| < 1 and O(x) = 0 for |x| > 1 into (7.5.8). This is a simple 
exercise in the calculus of variations which we shall leave to the reader. 

By combining (7.5.7) and (7.5.8), together with i) Oy = N, we obtain (7.5.2) 
with y, = C)/”. 

(ii) By HOlder’s inequality, {Joy < (Uo D7°Uo oy < vol(Q)?/> 
( Ts oy 3)3/ >. Inequality (7.5.3) is now an immediate consequence of 
(7.5.7). | 
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7.6 Instability for Bosons 
7.6.1 The N*/> Law 


If one allows the particles to have g = N different spin states, then the bound 
(7.1.1) holds regardless of the statistics of the wave functions. This was discussed 
in Chapter 3, Subsection 3.1.3.1. In particular, this shows that for bosons a lower 
bound of the form —C N°’? holds. This was already pointed out by Dyson and 
Lenard [44] and an alternative proof was later given by Brydges and Federbush 
[24]. The power 5/3 can be proved to be optimal in the case of infinitely massive 
nuclei [110], and we shall show this here. That is, in the bosonic case also an 
upper bound of this form holds. In particular, non-relativistic matter made out 
of bosons is stable of the first kind, but unstable of the second kind. This is the 
primary example of the distinction between the two kinds of stability. 


Proposition 7.1 (N°/? Instability for Bosons). Let Z; = Z for 1 <i < M. 
There exists a normalized symmetric wave function w of N boson coordinates, 
and positions R = (R,..., Ry) of M nuclei in R>, such that 


EQ) = (W, Hy) < —Ca?Z*3 min{N, ZMP? (7.6.1) 
for some constant C > 0 independent of all the parameters. 


Proof. For simplicity, we shall assume that M = n> for some integer n, and 
N = ZM. In order to show the desired upper bound, we can use a trial function 
of the form 


N 
W(x1,.-..xv) =] [ds@p. 
i=l 


Here 4 > 0 is a scaling parameter which will be optimized at the end. That is, 
we fix some function g with tt |g(x)|*dx = 1 and write 


d(x) = 07/7 (Ax). 


The energy of our trial function y is easy to calculate. We obtain 


(w, HW) = ne f |\Ve(x)|?dx + AWW(N, R), (7.6.2) 
R3 
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where 


2 2, 
W(N, R) = 5M = » ff sO oxy 


R°?xR3 
2 
28 | or Ol ax + UR). 


The nuclear repulsion U(R) is defined in (2.1.24). We want to show that we 
can find an R such that W(N, R) < —CZ?/3N*? for some constant C > 0. The 
desired upper bound (7.6.1) then follows immediately from (7.6.2) by optimizing 
over i. 

Choose g to be compactly supported, and divide the support of g into M cells 
Ty, 1 < k < M, such that 


1 
[scorer =a for all k. (7.6.3) 


We place one nucleus in each cell I, and average its position with respect to the 
weight M|e(x)|’, restricted to P,. The average value of W(N, R) is then equal to 


Ig@)P71g(y)I? 


1 1 
—N(N —1)—ZNM + =Z?M? / dxdy 
2 2 |x — y| 

R3 xR? 

- 


ATs T%, 


which implies that W(N, R) is less or equal to this value for some choice of R 
(because an average is never less than the minimum). 

Under the assumption N = ZM, the first term in (7.6.4) is negative and can 
be dropped for an upper bound. We are left with the problem of finding a lower 
bound on the self-energy terms 


! " Ig)P71g(y)I? 
2 


Ix — y| 


dxdy. (7.6.5) 
TyxT, 

Let r,; denote the radius of the smallest ball containing [’,. Then (7.6.5) is 

certainly greater than the smallest possible self-energy of a charge distribution 

of total charge 1/M confined to a ball of radius r;. It is well known that the 


smallest self-energy occurs when the charge is uniformly distributed over 
the boundary of the ball, and the minimal value equals (1/M)*/(2r;) [118, 
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Section 11.15]. In particular, 


1 2 2 
/ le(x)Ile(y)| din 
2, Ix — y| 2M2r, 


Ixy 


Using Jensen’s inequality,” M~! )°, r,' > M(>°, r,)7!. We thus have 
|g(x)| sent 1 1 
ge ff ayy a = 3M 


krexTy 


It remains to estimate ()°, r,)/M, the mean value of the radius of the smallest 
ball containing T’,. Note that we are free to choose the decomposition of the 
support of g into the cells I, subject to the normalization constraint (7.6.3). It is 
fairly obvious that for sufficiently regular g one can choose the decomposition 
such that ()°,r,)/M < CM~" for some constant C > 0, which proves the 
desired bound. In fact, in [110] the following explicit construction is given. 

For x = (x!, x7, x3) € R3, let g(x) = f(x!) f(x?) f(x?) for some function f 
supported in the interval [—1, 1]. Under the assumption that M = n? for some 
integer n, we can first decompose [—1, 1] into n intervals J; with the property 
that Si, | f(t)|-dt = 1/n for all 1 < j <n, and take the ri to be products of 
intervals. If I, is a rectangle with side lengths s, ¢ and u, the radius of the 
smallest ball containing I’, is r, = (st +17 + uy 2 <(s+t+u)/2. Note 
that the average value of s, ¢ and u for all the Py, equals 1/n = M~'/?. Hence 


M m=1 i 2 


in this case. This is precisely of the desired form. | 


7.6.2. The N’” Law 


The N°’? instability is mitigated somewhat if the finite nuclear masses are taken 
into account. The system is still unstable, but the energy goes as —-CN7/ in 
this case. It was Dyson in 1967 [43] who proved that the energy was at least as 
negative —CN’/> for charged bosons, but it wasn’t until 1988 that a rigorous 
lower bound of this form was demonstrated by Conlon, Lieb and Yau [34]. 
Finally, in 2004, the correct value of the constant C for large N, as conjectured 
by Dyson, was proved as a lower bound by Lieb and Solovej [132], and as an 


> If f is a convex function and yw is a positive measure on R then f f(x)u(dx)/p(R) = 
fCf x w(dx)/u(R)). See [118, Sect. 2.2]. 
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upper bound by Solovej in 2006 [172]. All these works consider the special 
case of positive and negative charges of equal magnitude, and equal masses 
of the particles. In principle, the analysis applies to a more general choice of 
charges and masses, however. For simplicity, we will only discuss the simpler 
case of equal masses and charges here. The Hamiltonian of the system under 
consideration is given by 


1 y €;e; 
H=--)Y A; oa 7.6.6 
- d +» ool (7.6.6) 


l<i<j<N 


where e; is either +1 or —1. Note that here N denotes the total number of 
particles, both positive and negative. It follows from Theorem 3.3 that the infi- 
mum of (Ww, Hy) over all wave functions w is the same as the infimum over 
wave functions that are symmetric in the coordinates of the positive and of the 
negative particles. Hence Bose symmetry is not really relevant here and one 
might as well forget all symmetry requirements. The ground state energy of H 
is defined by minimizing not only over all wave functions y but also over the 
values of e; = +1. 

If one accepts the N’’/> law then, by scaling, the system contracts to a diameter 
N~'/> because the kinetic energy has to be Nx diameter~*. What is not so 
obvious is that the potential energy can be made as negative as —N7/°. The 
Coulomb energy of the average charge density, which is zero, vanishes and, 
therefore, the Coulomb energy is entirely exchange-correlation energy in this 
case. Thus, in order to achieve this very low energy, it is necessary to employ a 
sophisticated wave function, namely the one suggested by Bogoliubov in 1947 
[21].° In the end, one would expect that the one-particle density of the particles 
is given by some mean field equation. In fact, the density will be asymptotically 
equal to the @ that minimizes 


; i vee] ax ~ Ina / o(x) dx, (7.6.7) 


R3 R3 


3 A decade later Bardeen, Cooper and Schrieffer [7] used the fermionic analogue of this wave 
function to explain superconductivity. 
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under the condition that = o(x)dx = N. Here, Jp is Foldy’s constant [68] 


Co 
ae 23/27(3/4) 
h=(= (1+ -PVi442) de = = © 0.5744, (7.6.8 
(2) f(r Pe) OSTA) rs 
0 
which we will explain below. Note that o is the total density of N/2 positively 
charged and N/2 negatively charged particles. 
If one defines O(x) = N~*°Q(N7'/°x)'/?, then fy, O(x)?dx = 1, and ® 
satisfies a Lane-—Emden equation 


5 
—Ad(x) = 5 foal" &(x)/? + wd(x) = 0 


for some jz > 0. It is possible to prove that the minimizer of (7.6.7) is unique 
up to translations [13, 100, 141, 189]. 

The proof that this is the correct large N asymptotics is very complicated. 
Even the imprecise bounds found in [43] and [34] are very complicated, and 
we will not attempt to reproduce them here. There are no simple proofs in this 
subject, comparable to the stability of matter proofs for fermions presented in 
the beginning of this chapter. 

The constant Jo in (7.6.8) was derived by Foldy in 1961 [68]. He considered 
a different problem in which there is only one species of bosons with negative 
charge, free to move in a background charge of uniform positive density 9. This 
is also known as the jellium model; it will be discussed further in Section 14.7. 
By astraightforward application of Bogoliubov’s 1947 procedure (using creation 
and annihilation operators on Fock space), Foldy arrived at the formula 


Eo 5/4 


at x — Iya’! ols (7.6.9) 


which he conjectured to be asymptotically correct for large 9. A lower bound of 
this form, but with a different constant, was also proved in [34], and the correct 
lower bound (7.6.9) for large @ was proved in [131]. The corresponding upper 
bound was derived in [172]. There is apparently no simple way to understand 
the exponent 1/4 in (7.6.9). It is subtle quantum-mechanical correlation effect 
and cannot be understood in classical terms. 

The connection between the two problems was recognized by Dyson in his 
paper in 1967 [43]. His considerations led to the energy minimization problem 
in (7.6.7). We can make this plausible in the following way. The kinetic energy 


-1/3 


consists of two parts. One is very local on a length scale of order o and is 
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responsible for the correlations among the particles. Second, there is a global 
kinetic energy, which is not accounted for in the Foldy energy, which is the energy 
of a homogeneous system. This extra envelope energy is simply (1/2) [ |V./@ le 
The potential energy is slightly more subtle. According to (7.6.9) the energy per 
unit volume is —Jga°/*0>/* and hence one might expect that the second term 
in the energy functional (7.6.7) should be 2Ipa>/* S(ex)/2)°4dx. This is the 
answer one would get in a mean-field picture in which the positively charged 
particles form a uniform background for the negatively charged particles, and 
vice versa. The truth is that at high density, the correlation energy can actually 
be understood perturbatively. In the jellium case, the particles stay out of each 
other’s way, and this gives rise to the Coulomb correlation energy. In the two- 
component case, like charges will also stay out of each other’s way but, in 
addition, they will prefer being close to the opposite charge. Both give a negative 
Coulomb correlation energy. The point about the perturbation theoretic argument 
is that the correlation energy for both these effects is the same, and both give 
a negative contribution to the energy. If one accepts this argument, the total 
correlation energy should contain the total @, and not 9/2, as given by the 
second term in (7.6.7). 

Poetry aside, it is rigorously established in [132, 172] that the energy for large 
N is given by minimizing (7.6.7). 

One last remark. It would seem that the jellium problem should be easier than 
the two component case. This is indeed so in the rigorous proofs in [130] and 
[132], as well as in Dyson’s original heuristic arguments [43]. In [34], however, 
where the correct exponents in the N dependence where first proved, it was 
found that it was easier to prove the two-component case first, and derive the 
jellium case from it. 


CHAPTER 8 


Stability of Relativistic Matter 


8.1 Introduction 


When electrons are treated relativistically the stability of matter question will 
obviously be more delicate than in the non-relativistic case. The reason is that | p| 
is much less than |p|? when |p| is large. (Recall that |p| — m < /p? +m? — 
m < |p| so, for our purposes, we may as well consider just |p| instead of 
/ p? + m? — m for the kinetic energy of a particle of mass m and momentum p.) 
The premonition that this subject has significant dangers is borne out by the fact 
that the stability of a single hydrogenic atom, which always occurs in the non- 
relativistic case, is found to occur relativistically if and only if Za < 2/z.! 
The implication for the many-body problem is, of course, that even stability 
of the first kind requires that Z;a must be less than or equal to 2/7 for every 
1 < j < M. Butis this enough? The surprising (or not so surprising, depending 
on your point of view) fact is that a bound on aq itself is needed, no matter how 
small the maximal Z; might be. Recall from Section 3.2.1 that stability of the 
first kind and stability of the second kind are equivalent for relativistic matter. 

The fact that a separate bound on @ is needed was unknown in the physics 
literature, and is largely unknown in the physics community even to this day. 
Given that a is below some critical value, is it true that stability will hold all 
the way up to Zja@ = 2/7 for all j, independent of the number of nuclei (and 
independent of a)? The answer is yes, and this was proved (in the absence of 
magnetic fields) in 1988 [138]. This fact remains true in the presence of arbitrary 
magnetic fields (ignoring the o - B term), but it took two decades more to prove 
it [71]. Section 8.7 contains a summary of known results and a short history of 
the problem. 


' A bound Za < 1 is known to be needed for the Dirac theory of the hydrogenic atom, and the 
need for a bound in both cases has the same origin. Cf. Chapter 10. 


139 


140 Stability of Relativistic Matter 


The Hamiltonian under consideration, as described in Chapter 3, is 


N 
H =) °T(p;) + aVc(X, R), (8.1.1) 
i=l 
with Vc given in (2.1.21). The kinetic energy is TJT(p)= 
V(p + Ja A(x))? + m2 — m, including the possibilities that m = 0 or A = 0. 

Before investigating stability of the Hamiltonian (8.1.1), we shall first study 
the relativistic one-body problem in detail in Section 8.2. Sections 8.3 and 8.4 
contain additional bounds which will be useful in the sequel. The proof of 
stability of relativistic matter is given in Section 8.5. The proof given there 
does not yield the best known results, but it has the virtue of being the most 
simple and direct, and is the closest to being a natural generalization of the 
proof of the stability of non-relativistic matter given in Section 7.1 in Chapter 7. 
In particular, we do not give the rather complicated proof of stability up to 
Za = 2/7 in the many-body case. An alternative proof, which will be important 
later in Chapter 11, will be given in Section 8.6. We summarize the best known 
results concerning stability of relativistic matter in Section 8.7. 

Another question, which will be addressed in Section 8.8, is an upper bound 
on the allowed values of a for which stability of relativistic matter holds. We 
shall show that instability of the first kind really does occur if @ is too large; the 
requirement of a bound on @ in Section 8.5 is not just an artifact of our proof 
but actually reflects reality. 


6.1.1 Heuristic Reason for a Bound on a Itself 


As will be shown in the next section, stability requires a bound Za < 2/z for 
each nucleus. This fact, alone, leads to the conclusion that a itself must be 
bounded, as we now discuss heuristically. Indeed, it was shown in [38] that 
one can create an arbitrarily negative energy using only one electron and many 
nuclei if @ is greater than a certain value of the order of unity, no matter how 
small Z might be. 

Imagine that we have M nuclei of charge Z and that Za =e « 1. Let M 
be such that Me ~ 4/z. Thus, if we can bring the M nuclei together we will 
create, effectively, a nucleus with Z'a ~ 4/z and then one electron can have 
an arbitrarily negative energy. (Recall that whatever energy bound we have is 
supposed to be independent of the location of the nuclei.) What might prevent this 
nuclear fusion from happening is the Coulomb repulsion of the nuclei. Suppose 
the nuclei are located in a small ball of radius 5, in which case the nuclear 
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repulsion energy is about C/5 with C = M?Z?a = M?(Za)*/a © (4/2)? a7!. 
This number will be small if a is large. In short, the effective repulsion, which 
supposedly prevents collapse, will be small if a is large. This argument suggests 
that if we regard Za as fixed then an upper bound on q@ is needed because a 
tiny value of Z can be offset by a large number of nuclei. It turns out that this 
conclusion is correct, even if one does not believe the argument. 

For two nuclei, with Za = 2/z for each, this argument can be made rigorous, 
in fact, as was done in [38]. This will be further discussed in Section 8.8 and 
the requirement of an N-independent bound on a, first proved in [138], will be 
proved there. In particular, the dependence of this bound on q, the number of spin 
states of each electron, will be shown to be ~ 1/q'/°. This, in turn, implies that 
there is never stability of the first kind for relativistic bosons because N bosons 
can be thought of as N fermions with g = N, as explained in Section 3.1.3.1. 
(Recall that non-relativistic bosons are always stable of the first kind, but not the 
second.) 


8.2 The Relativistic One-Body Problem 


Before discussing the problem of stability of matter with relativistic kinematics 
we consider the corresponding one-body problem. Since |p| and 1/|x| both 
scale like an inverse length, there is a critical coupling constant above which 
even stability of the first kind fails. 

In the non-relativistic case the relevant function space for one particle was 
H'(R*), namely functions f such that f(x) € L?(R*) and lk| f(k) é17(R°), 
In the relativistic case under consideration here, the relevant space is H 1/2 qR3), 
namely f(x) € L?(R*) and |k|'/2 F(k) € L?(R*). This can be summarized as 
(1 + |k|/2) Fk) € L2(R3). 

We start with the following simple observation. 


Lemma 8.1 (Equivalent Form of Relativistic Kinetic Energy). For 
f € H'/?(R®), we have 


(f PIA = / |20rk|| FU) ak 


Rd 


a (= -) pit | If) — FOP aay 


2. |x — yet 


(8.2.1) 
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Remark 8.1. Part of the lemma is the assertion that the integral on the right is 


well defined for an H'/? function, despite the apparent non-integrable singularity 
eS yi, 


Proof. The following proof is the same as in [118, Sect. 7.12]. For an alternative 
proof, see [72, Lemma 3.1]. We first note that 


1 
Cf [pI f) = lim —(f, 1 — ef), 


which follows by writing out the expectation values as integrals in Fourier space, 
and using dominated convergence to justify the limit t > 0. 

Recall that with our convention for the Fourier transform, the operator | p| = 
/—A acts via multiplication by 27|k| in Fourier space. The inverse Fourier 
transform of g;(k) = e~°"*! on R¢ is given by [118, Sect. 7.11] 


> d+1)\ wasn t 
G(x) =0 (=) 1 aa Epa (8.2.2) 


which can easily be obtained from the formula (4.4.10) in Chapter 4. It can also 
be computed directly by elementary means. Hence 


1 Ze 
(FU ef) = 5 i / Lf) — FO) BO — y)dedy. 
R¢xR¢ 


The lemma follows by first dividing by ¢ and then taking tf — 0. The monotone 
convergence theorem allows us to take the limit t — 0 of the integrand. | 


Remark 8.2. We record, for the reader’s possible interest, the analogous Fourier 


transform of g”(k) = exp(—t,/(2xk)? + m2) in R3 (see [118, Sect. 7.11] for 
the general R¢ case): 


an 2 
(m) = m t 
g (x)= Wn? 4x? Kx(myV x2 + t?), (8.2.3) 


where K> is the modified Bessel function of the third kind. Notice that (8.2.2) 
and (8.2.3) have a nice Euclidean form (after multiplication by t~'), i.e., they 
are functions only of the d + 1 dimensional distance ./t? + x?. 


The representation (8.2.1) is useful for establishing the following inequality, 
which can be interpreted as a relativistic uncertainty principle. 
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Lemma 8.2 (Relativistic Hardy Inequality). Letd > 2, andlet f be anon-zero 
function in H'/*(R¢). Then there is the strict inequality 


r d+ly2 2 
ae a sa 
ace 


(Ff. IPI) > 2 


(8.2.4) 


Moreover, the constant is sharp, i.e., for any bigger constant the inequal- 
ity fails for some function in H'/*(R4). For d = 3 the constant in (8.2.4) is 
2T Oy /1 12) = 2/x, 


Remark 8.3. This inequality goes back to Kato [99] and Herbst [89]. See 
also [184, 12, 188, 72]. The non-relativistic analogue is Hardy’s inequality 
Spa IV @)I?dx > [(d — 2)?/4] faa | f(x)? lx|-7dx for d > 3. 


Proof. For simplicity, we restrict our attention to the case d = 3, which will be 
the relevant case for our considerations on stability of relativistic matter. For 
general d, the proof works the same way, but the integrals involved are slightly 
more complicated. See [89, 188, 72]. 

According to Lemma 8.1, 


1 _ 2 
(fpf) = 5 | FAC, ao dx dy, (8.2.5) 
1 |x — y| 


R3 x R3 


which we can write as 


dx dy (8.2.6) 


i] f(x) — f(y)? 
Me —yl?+e) , 


. 1 
(flpif) = lim = | 
R?x 


R 


The purpose of ¢ is to avoid the singularity at x = y. The limit e — 0 will be 
taken at the end of the calculation. 
We can write 


— el f@) = ll FO? 


If@) — fODP = +1 fe? (1 : =) 
Ix||y| ly| 


+LFOP (1 : 2") . (8.2.7) 


|x| 
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Hence the integral in (8.2.6) can be written as 


IIx] f@) = lvl FOP dx dy I |f@)P ( =) 
2 1- dxdy. 
If eserer itt fd e-PRO 


R3 x R3 
(8.2.8) 


After performing the y angular integration, the second integral equals 


870 i} dx| f(x) |? Ic i= 1x) . (8.2.9) 
(|x|? — 1°)? + 2e(|x|? + 1?) +? 
R3 


The ¢ integral is non-negative and converges, as ¢ — 0, to the principle-value 
integral 


CO 
e—le) yt 
(x22)? ZI] 


(8.2.10) 


It is tedious but elementary to justify exchanging the x integration and the e — 0 
limit. 
We conclude that 


_ Ix f@) — lyf)? dx dy 
iain fife Pie = ma sn ff (x—yP +e? Illy 


(8.2.11) 

Since the right side is positive, this proves (8.2.4). The inequality is strict since 

the right side of (8.2.11) is zero only if f(x) = c/|x|, which is not an Hy 
function for c 4 0. 

Note that the right side of (8.2.11) is finite for a compactly supported function 

f that diverges like 1/|x| for small |x|, while i | f(x)|?|x|~!dx would be infinite 

for such a function. A simple limiting argument thus shows that if 2/7 is replaced 

by a larger number the inequality (8.2.4) cannot hold. The constant 2/7 is, 

therefore, sharp. | 


From Lemma 8.2 we conclude that the energy functional 


(f, |pLf) — Ze | lo at 


for f € H'/?(R%) with fps | f(x)/?dx = 1 is bounded from below (in fact, posi- 
tive) for Za < 2/zr, while it is unbounded for Za > 2/2. The same is true if the 
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mass is taken into consideration, i.e., | p| is replaced by \/ p? + m2 — m, since 


|p| —m < /p?+m?—m <\|p|. (8.2.12) 


Hence, stability of the first kind holds for a relativistic hydrogenic atom if and 
only if Za < 2/zn. 


8.3 A Localized Relativistic Kinetic Energy 


Before we turn our attention to the problem of stability of matter in the relativistic 
case, we state and prove a local version of the inequality (8.2) which will be 
needed in the following. It is taken from [138, Theorem 7]. 


Lemma 8.3 (Localized Relativistic Kinetic Energy). Let B be a ball of radius 
D centered at some w € R°. Then 


fo _ =f 
| yi dxdy 
BxB 
; : (8.3.1) 
se A) ENE ae: [ vox - wispy F00Pex, 
a |x — w| D 
B 
where 
¥(@) = : 18 i ag 1-? inti t uid 1 
Y= in aac eee ee 


(8.3.2) 


Note that Y(t) is non-negative, continuous and bounded for 0 < t < 1. Numer- 
ical evaluation shows that Y(t) < 1.57. 


Proof. Without loss of generality, we can set w= 0 and D= 1. Leth bea 
strictly positive function on [0, 1]. Similarly to (8.2.7), we can write 


fe) — fa) 
h(x) 


2 en) (1- a) 
+ | f(x) (1- Ax) +1 f(y? AyD)” 


f(x) — f(y”)? = AUxDAC YD) | 


146 Stability of Relativistic Matter 


Using positivity of the first term, and performing the angle integration as in the 
proof of Lemma 8.2, we obtain 


= | £2 — f(y)? 
Qn? — y|* 


dxdy 


BxB 


> = f ax ipo? pw Jain <5 (1- wey) (8.3.3) 
as (ime =2 A(\x|)J 0 


(Compare with the proof of Lemma 8.2, where the choice h(t) = 1/t was used.) 
In the following, we will choose h(t) = 1/t +t. This particular choice is 
motivated by the fact that the derivative of h at t = 1 vanishes. It is easy to see 
that the potential on the right side of (8.3.3) is bounded at |x| = 1 only if h has 
this vanishing derivative property. Upon evaluating the integrals, we find that 


1 
af h(t) ilo 
P “| (xe (1 . wen) oS 


which finishes the proof. | 


8.4 A Simple Kinetic Energy Bound 


The following simple inequality gives a lower bound on the many-body kinetic 
energy in terms of the kinetic energy of the square root of the one-particle 
density. This bound has proved to be useful in several contexts. For 6 = 2, 
it was first discovered by M. & T. Hoffman-Ostenhof [92]. It works both in 
the relativistic case and in the non-relativistic case, and uses the positivity 
of the Fourier transform of exp(—t|p|°) for 0 < 6 < 2 [150, Example 2 in 
Section XIII.12].* Since exp(—t,/|p|? + m7?) is also positive definite, it also 
works in the relativistic case with non-zero mass. It applies to any many-particle 
wave function, irrespective of the symmetry type. It is partly also a diamagnetic 


? In [150] this positivity of the Fourier transform of exp(—t| p|°) is stated for 0 < 5 < 2 ford > 2 
while it is only stated for 1 < 6 < 2 in the case d = 1. With the help of Eq. (4.4.10), however, 
the result in [150] can be easily extended to 0 < 5 < 2 ford = 1. For all d, a direct proof of the 
positivity for 5 = 1, the case of interest to us, can be obtained directly from the formula (4.4.10), 
starting with the known positivity for 6 = 2; further use of that formula will yield positivity for 
6 =1/2,1/4,.... 
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inequality, since the magnetic vector potential A appears only on the left side of 
the inequality, not on the right. 


Lemma 8.4 (Kinetic Energy Bound in terms of the Density). For 0 < 6 < 2, 
the bound 


N 
(¥ S- |p + VaA(x:)|” ‘) > (Joy, \Pl’ Jer) (8.4.1) 


i=1 


holds for any N-particle wave-function yy € H®!?(IRN¢) with density Oy (defined 
in (3.1.4), and any magnetic vector potential A. (The Pauli exclusion principle 
plays no role here.) 


Proof. We first consider the case A = 0. In terms of the one-particle density 
matrix y“ of y (defined in Eq. (3.1.27)), Inequality (8.4.1) says in this case 
that 


[cere ff avay er (yx, ») — VeuCe0v(0)) = 0. 
R¢ R¢xR¢ 
(8.4.2) 


Note that the positive definiteness of y!) implies that |y“(x, y)|* < 
yx, x) yy, y) = Ov(X)oy(y), with equality for x = y, of course. Since 
| p|° = lim,_,9(1 — exp(—t| p|°)/t, (8.4.2) follows if we can show that 


faceoff aay 9 (y'%Gx, 9) — Vau(@e¥0)) <0 


R¢ R¢xR¢ 


for all ¢ > 0. But this inequality follows immediately from the fact that 
exp(—t|p|°) has a non-negative Fourier transform for 0 < 5 < 2. 

In order to extend the inequality to the magnetic case A + 0, it is sufficient 
to show that 


/ / gAcx, yyw, y)dxdy < / i 2(x, y)ly(x, y)idxdy 


R? xR3 R3 x R3 


= /| 2(x, y)x/Oy(x)/oy(y)dxdy, (8.4.3) 


R?xR3 
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where g(x, y) denotes the integral kernel of the operator e~' IP+VaA@! This 
inequality follows from the fact that leAcx, y)| < g(x, y) for allx, y € R* and 
t > 0, which is stated in Theorem 4.4. |_| 


8.5 Proof of Relativistic Stability 


In the relativistic case, the proof of stability of matter is more complicated than 
in the non-relativistic case mainly for one reason: although the relativistic kinetic 
energy dominates the Coulomb potential for small enough coupling, as shown in 
Lemma 8.2, the semiclassical expression for the relativistic kinetic energy, which 
was shown in Chapter 4 to equal { Oy(x)*/ dx, can not dominate the Coulomb 
energy, — / 0y(x)|x|~'dx. More precisely, among all non-negative densities 0 
with { o(x)dx < 1, say, the quantity E(0) = f o(x)4/3dx — A f o(x)|x|~'dx is 
not bounded from below, no matter how small A is. If it were bounded, it would 
have to be non-negative, for otherwise €(@) can be made arbitrarily negative by 
rescaling o(x) +> y30(yx), which preserves ‘| o(x)dx but drives E(@) to —co 
as y — oo. But for any given 9, €(@) can be made negative simply by replacing 
o(x) by ee(x) and choosing ¢ sufficiently small. 

Simply mimicking the proof of stability of matter in the non-relativistic case 
will not yield a proof in the relativistic case. The Coulomb singularity can not 
be controlled by the semiclassical kinetic energy. As shown by Lieb, Loss and 
Siedentop in [122] stability can, indeed, be proved in a somewhat similar way, 
however, if part of the kinetic energy is bounded from below as in Lemma 8.4 
above, and the remaining part is estimated via the Lieb—Thirring type bound 
(4.2.11) in terms of the semiclassical expression. We will now explain this 
method in detail, following closely the strategy in [122]. We repeat what was 
already said in the introduction: This method does not yield the best results, but 
it is the simplest route to a proof of relativistic stability. 

For simplicity, we consider here only the case m = 0, i.e., the case of mass- 
less particles. The necessary modifications for m > 0 will be discussed in 
Remark 8.5. We note that in the absence of mass, all terms in the Hamil- 
tonian scale like an inverse length. Hence, under the scaling transformation 
WX) VNPw(yX), Rte yR and A(x) > y~!A(yx), H transforms to 
yA for y > 0. In particular, the ground state energy Eo of H satisfies (for all 
N, M,« and all choices of the Z;) 


E9=0 or Eg=-o. (8.5.1) 
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(As defined in Chapter 3, we understand EF to be the infimum of (yw, H yy) over 
all choices of y with the right symmetry, all choices of the nuclear coordinates 
R and all vector potentials A.) This also means that stability of the first and the 
second kind coincide in the relativistic case, as we said. This situation is not 
changed by the introduction of a mass, as explained in Remark 8.5 below. 

We shall offer two proofs of relativistic stability. Neither one gives optimal 
results — which are recorded in Section 8.7. In this and the next section we shall 
prove the following theorem. 


Theorem 8.1 (Conditions for Relativistic Stability). Let m = 0. For any anti- 
symmetric wave function with q spin states, and any vector potential A, 


N 
( yo |p + VeACn)| ‘) +a(p, Vow) >0 


i=1 


as long as three conditions are satisfied: 
° Za < 2/n, with Z = max{Z;} 
© 1.68a < (1 —aZn/2) Kq 


7 (aZ)a 
[1 — aZn/2) Kq7'/3 — 1.68a}3 


7.6045 G) +4] < (4/3). 
(8.5.2) 


Here, K is the optimal constant in the relativistic kinetic energy inequality 
(4.2.11), which is known to satisfy K > (6.08)~1/33(6zr7)!/3 = 1.60. For this 
value of K, the three conditions (8.5.2) are satisfied if 


1 Z 
ae > 4 2.2566 g/3Z2/3 + 1.0307 ql/3. (8.5.3) 
a 


Remarks 8.4. We note that condition (8.5.2) is fulfilled, for any fixed value of 
Za < 2/z, if a is sufficiently small (a has to go to zero as Za — 2/7). On 
the other hand, for the physical values g = 2 and a = 1/137, Ineq. (8.5.3) is 
satisfied for all Z < 58.5 and hence stability holds for these parameter values. 
This number should be compared with 2/(77@) © 87.2, which will be achieved 
later (see (8.7.1)). 

Note that if 1.68a > Kq~!/? then the conditions of Theorem 8.1 cannot be 
satisfied for any choice of Z > 0. 
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We can summarize the conclusions of Theorem 8.1 by saying that for any fixed 
value of v < 2/z there is an @,(v) such that stability holds whenever Za < v 
and a < @,(v). Moreover, the lower bound on @,(v) derived in Theorem 8.1 
depends on q according to a, ~ q~! for large g. Although our lower bound on 
a-(v) unfortunately goes to zero as v — 277, we shall see in Section 8.7 that the 
actual w,.(v) does not go to zero as v > 2/7, even with an arbitrary magnetic 
field. In Section 8.8 we shall see that the large g asymptotic behavior a, ~ q7! 
is, in fact, correct. This is not to be confused with the behavior of the critical a 


or fixe as opposed to fixe a), whichis g”’/°,notq. 
for fixed Z (as opposed to fixed Za), which is g~!/7 ie 


The heart of the proof of Theorem 8.1 is the following lemma, which will also 
be useful later in Chapter 13. 


Lemma 8.5 (Energy Bound with the Nearest Nucleus Potential). For fixed 
R=(R,,..., Ry), let D(x) = min; |x — R;|. Let e and d satisfy e > 1d/2 > 
0. For any normalized, fermionic wave function wy € ai L?(R3; C1), and any 
vector potential A, 


al Xr Ma | 
(¥ > (« [pi + VoA(xi)| — =) “) > CoS dX Dy’ 


(8.5.4) 


ce \4 1 
C= (3) [7.63 (5) +4] ai (8.5.5) 


and K as in Theorem 8.1, As before, Dj = (1/2) mingz; |Rx — Rj;| denotes half 
the distance of R; to its nearest neighbor nucleus. For K = (6.08)—!/3 3(6n7)1/3 
we have C = 1.514. 


with 


Proof. Without loss of generality we can set e = 1. For 0<A <2/z, we 
split the kinetic energy into two parts, |p| = (1 — 2A/2)|p| + (7A/2)|p|. After 
applying the relativistic Lieb-Thirring inequality (4.2.6) to the first part and 
Lemma 8.4 to the second, we obtain 


N 
(¥ So |p + Vor ACx:)| ‘) 


i=1 


> = 1r/9Kqn"? f ay(ayax + 12/2 (/Oy. |p|. /Qv)- (8.5.6) 


R3 
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In order to reduce the problem with many nuclei to the one with just one 
nucleus, and hence reduce the problem of stability of the second kind to stability 
of the first kind, we have to localize the last term in (8.5.6). Note that the 
M balls of radius D; centered at R; are non-overlapping. Hence Lemma 8.1! 
implies the lower bound 


2 
(Joy, Pl. /Oy) = ae Hy Vav(e) — Vav) dxdy. (8.5.7) 


|x — y|4 
|x—Rj|<Dj 


ly—Rj|<D; 


With the aid of Lemma 8.3, we can further bound the right side from below as 


(Joy. |p|, Ov) 


M 
32 / avtx) yt / ¥(\x — Rjl/Djoy(x)dx |, 
~ eS |x — Rj Dj —_ 

J=VN [x Ril<D, . eo RiSD, 


(8.5.8) 


with Y defined in (8.3.2). After multiplying this inequality by 24/2, we see that 
the singularity of the right side at the nuclei is exactly A/|x — Rj;|, whichis what 
we want in (8.5.4). In fact, by combining the bounds (8.5.6)-(8.5.8) we obtain 


. 1 
( S> |p + VeA(x:)| ‘) > ‘| (a +c) Qy(x)dx 
i=l 


+(1—7d/2)Kq"'° / oy(x)*Fdx, 


R3 
(8.5.9) 
where U/ is a bounded function, which takes the value 
—|x — R;\"! if |x —R;| > D; 
U(x) = , ; 4 J 8.5.10 
(*) —(1/2Dj)¥(\x — Rj|/Dj)_ if |x — Rj| < D; 


in the Voronoi cell ; = {x € R? : |x — Rj| < |x — R;| for all k F j}. 

We are left with deriving a lower bound on the two terms containing U/ 
and o in (8.5.9). For convenience, let A = 4Kq7!/3(1 — 14/2)/3. Using 
Holder’s inequality, it is easy to see that 


7A f entey'Pax +4 f Ucrey(x)dx > ~p {Pot (8.5.11) 
R3 R3 
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From the definition (8.5.10) of ¢ we can bound 


i \U(x)|4dx 


M 4 
1 


j=l I ie Ril<D; |x—Rj|>D; 
Ji— J Jia" J 


(8.5.12) 


The last integral equals 47/D. The first integral is 


Y(|x — Rj|/Dj)"dx = D; / Y(|x|)"dx ~ 7.63 Dj. (8.5.13) 
|x—Rj|<Dj |x|<1 
Hence we arrive at the lower bound 
4 


r m\4 =. il 
G51) 2-75 [7.5 (5) +4] Se 


i] J 


= 


which, in combination with (8.5.9), proves the lemma. | 


The estimate above can be improved slightly. In fact, the 47 in (8.5.5) can be 
replaced by 37, as argued in [122]. This can be achieved by an improved bound 
on the integral in (8.5.12) in the region |x — R;| > Dj;, by taking into account 
the restriction of x to the Voronoi cell I";. The same applies to Eq. (8.5.2) in 
Theorem 8.1. 

With the aid of Lemma 8.5, the proof of Theorem 8.1 is now quite short. 


Proof of Theorem 8.1. As discussed in Section 3.2.3, we can assume that Z; = 
Z for all 1 < 7 < M without loss of generality. To bound the potential energy 
from below, we use the same strategy as in the non-relativistic case, Eq. (7.1.3)— 
(7.1.8). In the relativistic case considered here it is important, however, to retain 
the positive last term in (5.2.6). More precisely, by combining the electrostatic 
inequality of Theorem 5.3 and the exchange estimate of Theorem 6.1, we obtain 


(yr, Vo) = 1.68 / oy (x)*3dx — I= 


1 
—— 0,,(x)dx + - 32 a = 
R3 R3 / 


D(x) 
(8.5.14) 
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Here, D; = (1/2) mingz; | Rx; — Rj;| denotes half the distance of R ; to its nearest 
neighbor nucleus and hence equals the distance of Rj; to the boundary of I;, 
while D(x) is the distance from x to the closest nucleus, i.e., O(x) = min; |x — 
R jl. 

In order to control the negative exchange term, we can apply the relativistic 
kinetic energy inequality of Corollary 4.1, which implies that 


1.68 aq!/3 


N 
7 ( > |p; + VaA(x;)| 7 — 1.680 f oy(xy'dx > 0. 


R3 
(8.5.15) 


We are thus left with the problem of deriving a lower bound on 


~ Za 
= U3 K-!) |p, y| 
(xc 1.68 aq'°K~') |p; + /aA(x;)| a5) ¥ . (8.5.16) 


Under the assumption that 1 —1.68aq!/7/K > Za/2 we can apply 
Lemma 8.5 to conclude that 


(Za)*q = 1 
8.5.16) > —C ; 8.5.17 

( )2 =n Zap2 — LeRag®RP LD; ( 
with C given in (8.5.5). In combination with (8.5.14) this proves positivity of 
(w, Hy) and hence stability as long as 


’ 


(aZ)*q p Za 
(1 —aZx/2—1.680g'/3/K) 8 
which is equivalent to (8.5.2). | 


Remark 8.5. In the previous theorem, we proved stability of relativistic matter 
in the massless case, in which the kinetic energy of particle of momentum p 
is equal to |p + ./aA(x)|. As already pointed out in Eq. (8.2.12), the simple 
inequality \/ p? + m2 — m => |p| — m shows that stability also holds in the case 
of non-zero mass whenever it holds with zero mass. The bound on the energy 
obtained in this way is not very good, however, at least in the case of small Za. 
It is simply —mN, which equals — AN in our units. This is the rest mass energy 
mc of all the electrons, which may be huge compared to the expected energy 
which is of the order (Za)*. A better bound can be obtained by including the 
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mass in the appropriate Lieb-Thirring inequality, which was done in [37] (see 
also [122].) See the discussion at the end of Section 4.1. 


Remark 8.6 (Stability of Relativistic Matter Implies Stability of Non-Rela- 
tivistic Matter). The elementary inequality 


2 

sP > a|p| — s 
which holds for any a > 0, allows one to deduce stability of non-relativistic 
matter from stability of relativistic matter. If we choose a big enough such that 
both a/a and Za/a are small enough to satisfy the stability criteria of Theo- 
rem 8.1, we conclude that the non-relativistic Hamiltonian (7.0.1) is bounded 
from below by —(a*/2)N. Compared with Theorem 7.1, this has the advantage 
of yielding a bound that depends only on N and not on M. We shall discuss this 
question further in Chapter 12. 


8.6 Alternative Proof of Relativistic Stability 


In this section, we shall demonstrate an alternative proof of relativistic stability of 
matter. It is closer to the proof in [138] and uses the refined electrostatic inequality 
in Theorem 5.5 instead of using the simpler Theorem 5.3 and the Lieb—Oxford 
inequality (6.2.14), which were employed in the proof in the previous section. 
Although not yielding better stability bounds, this alternative method of proof 
will turn out to be useful later in Chapter 11 when we discuss the stability of 
non-relativistic quantum electrodynamics. 

We proceed as in the proof in the previous section, but instead of using (8.5.14) 
we use Theorem 5.5, which states that 


7 M 


Z 
(yh, Ve #) = — / (a + P'(n)) od ea ears D5; 
R 


=1 


for any 0 < A < 1, where F*(x) = W*(x) — Z/D(x) which, in the Voronoi cell 
I’;, takes the value 


3D;'(1— D;*|x — Rj*)-! for |x — Rj| < AD; 


F*(x) = 
(V2Z +3) ke - Ril"! for |x — Rj| > ADj. 


(8.6.1) 
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After following the same steps as in the proof of Lemma 8.5 in the previous 
section, we conclude that 


(¥ (> |p; + Ja A(x;)| ave) “) 


i=l 
Za “ 1 
Xr 4 
> ~igs | Men+ (x)fdx + == dD, 
where A = (4/3)(1 — wZm/2)Kq7"°, and U is defined in (8.5.10). Using the 
triangle inequality for the L* norm, we have 


1/4 a4 


[ec + Praytax < [ecorter + [iPtcrtax 
3 R3 3 


The L‘ norm of / was already estimated (8.5.12)—(8.5.13), where it was found 
that 


\4 sas 1 
i: U(x)|4dx < [7.5 (5) + ‘| 2s ry, 
R3 JF . 


To bound the L* norm of F’, we split the integral over each I’; into a part where 
|x — R;| < AD; and a part where |x — R;| > A.D;. For the former, we obtain 


Xr 
7m 4 = 
IP-@itde = if T= aoa 


h 
wd t wz 1 
< — dt = —1). 
4D; (1 — t?) 24D; \d - A2)3 
: 


For the latter, we bound 


4n 4 
|F*(x)|*dx < / |F*(x)|\4dx = —— (V2Z41/2) . 
AD; ( ) 


PjM|x—R j|=AD;} {|x—Rj|2ADj} 


156 Stability of Relativistic Matter 


Summing up the contributions of all the Voronoi cells, we conclude that 
ae 
dre 14 
[ive + Peoter = MOY 7. 
R3 J= 


where A,(Z) is given by 


woe (Sty fe) 


24 \d —a)3 
1374 1/4\ 4 
+|7063 ic raz" ). 
Stability thus holds if 
Z (4\? K3 Zan \* 
amin A,(Z) < ca (5) 7 (1 = =) (8.6.2) 


a 
If we set g = 2 and a = 1/137, use the known bound K > 1.60 and com- 
pute the minimum over A numerically, we have stability as long as 0.0014 < 
Z < 43.3. If we invoke, in addition, the monotonicity argument explained in 
Section 3.2.3 in Chapter 3, this method proves stability if 0 < Z; < 43.3 for all 
l<j<M. 
For later use in Chapter 11, we display explicitly the main inequality proved 
in this section. 


Lemma 8.6 (Improved Bound with the Nearest Nucleus Potential). Let a, 
Z and q satisfy the stability condition (8.6.2). For all 0 < X <1, all vector 
potentials A and all nuclear coordinates R = (R,,..., Ry), 


(1 as 
(\ a (< |p; + VaA(x:)| — wer) ‘| >-—S — | (8.6.3) 
i=l 


for all normalized, fermionic wave functions w € Ke L?(R°; C2). The function 
W* is defined in (5.4.1). 


8.7 Further Results on Relativistic Stability 


The proof we have given of the stability of relativistic matter was chosen for its 
simplicity and for its direct connection with the various fundamental inequalities 
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proved in earlier chapters. The emphasis was on trying to present, to the extent 
possible, a unified picture of non-relativistic and relativistic stability. 

The actual historical path was quite different. As time went on better results 
were obtained, and some of them will be discussed here. Indeed, it was only in 
2007 that stability for values up to Za = 2/7 with arbitrary magnetic fields was 
finally obtained by Frank, Lieb and Seiringer [71]. 

Perhaps the earliest result was that of Daubechies and Lieb [38] in 1983 
that relativistic stability holds for one electron and M > 1 nuclei provided 
Z ja < 2/zm for all j and a < 1/37. It was also shown that stability of the first 
kind fails if a is large and M = 2 (and hence for N electrons and 2N nuclei). 
This appears to be the first mention of the need for a bound on a, and we discuss 
this M = 2 case further in the next section. 

The first proof of stability for all N and M is Conlon’s [32] which gave stability 
if Z = 1,q =1anda < 10-*”. Despite this very poor constant, Conlon’s 1984 
paper was a major achievement and showed that stability had an affirmative 
solution — which was not beyond doubt at the time. His paper was followed in 
1986 by Fefferman and de la Llave’s proof [60] of stability for Z = 1,g = 1 
and a < 1/(2.067). 

In 1988 Lieb and Yau [138] proved two stability theorems. The simpler one 
showed stability for all Za < v <2/m and a <a,(v), but with a.(v) > 0 
as v — 2/s. This simpler proof can easily be extended (although this was not 
stated explicitly) to include a magnetic field. The more complicated theorem was 
valid all the way up to Za = 2/z for a < 1/(47q), which includes the physical 
case g = 2, a = 1/137. Unfortunately, the proof of this second theorem does 
not generalize easily to include a magnetic field. The problem of modifying the 
proof to include a magnetic field without weakening the Za < 2/z condition 
was finally solved in [71], but at a price of reducing a: For the Hamiltonian 


N 
H =) |p; + VeA(w,)| + V(X, R), (8.7.1) 
i=1 
stability holds for all A(x) if Za < 2/m and qa < 65° 


This just barely covers the physical case q = 2 and a = 1/137. 
The 1996 paper by Lieb, Loss and Siedentop [122] is the one that takes the 
simplifying track that we followed in Section 8.5. 
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The necessity of a being small in all these results is not an artifact of the 
method. This was already clear from the early result in [38], as was discussed 
in Section 8.1.1. One can show that for a large enough, there is instability for 
large enough M, irrespective of how small Za might be. Since Z > | in nature, 
this result may not seem to be of practical importance (because the condition 
Za <2/m already implies the need for a bound on a, namely a < 2/rr), but it 
is important conceptually. It says that relativistic stability of matter necessarily 
implies a bound on the value of the fine structure constant a. The instability for 
large a was proved in two theorems in the Lieb—Yau paper [138], and we outline 
these two results in Theorem 8.2 below, along with a summary of the known 
stability theorems. 


8.8 Instability for Large a, Large g or Bosons 


The question addressed in this section is whether instability occurs if @ is large 
enough, regardless of what Za > 0 might be. For one electron and one nucleus, 
i.e., a hydrogenic atom, only Za appears in the Hamiltonian and hence no 
separate bound on q@ is needed for stability. The simplest non-trivial case to 
consider, therefore, is one electron and two nuclei. As stated above, this was the 
first case in which instability was noted [38]. Suppose the two nuclear charges 
are separated by a distance R = |R, — R2|. Let E(R) be the ground state energy 
of 

Z\a Za 
Ix — Ril |x — Rol 
By scaling, E(R) = C/R for some constant C < 0, depending on Z,a@ and 
Z2a. Now, if (Z; + Z2)a < 2/m, then C = 0. If, however, (Z; + Z2)a > 2/7, 
then C <0, since we can find a wave function w with (w, |plw) < (Z,; + 
Z2)a(y, |x|~!w) and put the two nuclei arbitrarily close to the origin to make 
the energy of yw negative. As long as Zja@ < 2/m and Z2.a < 2/r,, C is finite, as 
can be easily deduced from Lemma 8.3. 

In opposition to this negative energy C/R, which attracts the nuclei, there is 
the nuclear repulsion, which equals 


Z\Zoa — (Zya)(Zoa) 1 
R at R 
Hence, for fixed Zia and Z2a, we can make a big enough so that this nuclear 


|p| — 


repulsion is smaller than |C|/R. For this value of a, the ground state energy Eo 
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of one electron and two nuclei is thus negative. Hence Ey = —ow by scaling, as 
remarked earlier. 

The same argument works for any number of nuclei. One electron will cause 
instability (of the first kind) if a is too large. The number of nuclei needed 
for instability will increase as Za — O (because we require the total nuclear 
charge eal Zx to exceed 2/(zra)) but the important point is that there is a fixed 
number a, such that instability always occurs (with sufficiently many nuclei) 
when a > a@,, regardless of how small the values of Z;a may be. 

In combination, Theorem 8.1 in Section 8.5 together with its improvements 
discussed in Section 8.7 state that the relativistic Hamiltonian (8.7.1) is stable, 
1.e., non-negative, for all M, N and all magnetic fields if the condition 


= 


Z 
= + 2.2566 q!/3Z23 + 1.0307 q'3 


or the condition 


qa < 665 and Za < 2/1 


is satisfied. For comparison with the following instability theorem, 8.2, it is 
convenient to reformulate the stability conditions as follows. There is stability if 
all of the following three stability conditions hold: For appropriate C > 0 and 
Cc’ > 0, 


(1) Z;a < 2/z forall j =1,...,M, 
(2) (Zja)’a < C/q forall j =1,...,M, 
(3) a <C'q7'3. 


That this simple set of stability conditions (almost) correctly describes the true 
state of affairs is the content of the next theorem, proved in [138, Thms. 3 and 4]. 


Theorem 8.2 (Conditions for Instability). There are constants D < 115120 
and D’ < 128/(152) such that the Hamiltonian (8.1.1) is unstable of the first 
kind for large enough N and M if one of the following three conditions 
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holds: 


(i) Zja >2/n for some j, 
(ii) (ZZja)’a > D/q forall j, 
(iii) a > D’. 


As will be demonstrated in the proof of Theorem 8.2, under condition (ii) it 
suffices to have only q electrons to cause the collapse. Under conditions (i) and 
(iii) only one electron is needed for instability! 

The three instability conditions (i), (ii) and (iii) in Theorem 8.2 are (almost) 
the negation of the three stability conditions (1), (2) and (3) displayed above, 
except for the numerical values of the constants, and the factor g~!/? in the third 
condition. For fixed Z both the stability and the instability bounds show that the 
critical w is of the order g~'/? for large g. But if we want a bound that is uniform 
in Z (valid for arbitrarily small Z) is the critical a of order 1 or of order g=!/3 ? 
We conjecture that the former is correct. 

We believe that the factor g~!/? should actually not be part of the stability 
condition (3); that is, a < C’ for some constant C’, together with stability 
conditions (1) and (2), should be sufficient for stability. This is an open problem. 
We note that if the Lieb—Oxford inequality used in the proof of Theorem 8.1, 
Eq. (8.5.14), held with a factor g~!/? multiplying the ‘i o*/? term, our proof of 
Theorem 8.1 would actually yield the condition a < C’ instead of a < C’q7!/. 
This factor would naturally occur from considerations of the homogeneous 
electron gas; see the discussion in Chapter 6, Eq. (6.2.11). As noted there, a 
general exchange inequality cannot hold with this factor g~!/7, however, but our 
conjecture is that it holds with g~!/? for the wave functions actually relevant for 
causing instability. 

Note that if we define a,(v) as the critical value of a for stability to hold for 
all Za < v, then the stability and instability results imply that a.(v) ~ 1/q for 
large g, as mentioned in the Remark 8.4. 


Proof of Theorem &.2. We choose A = 0 and, without loss of generality, set 
m = 0. Instability under condition (i) has already been shown in Section 8.2. 
We shall now prove instability under condition (iii). We take only one species 
of nuclei, i.e., we set Z; = Z > Oforall 7 = 1, 2,..., M. Recall that it suffices 
merely to show that the energy can be made negative, after which it can be 
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driven to —oo by length scaling. For simplicity of the notation, we shall ignore 
the spin degrees of freedom here. Alternatively, think of either g = 1 or all spin 
components but one be zero. Pick a @ € L?(R?) with ||¢||2 = 1 and with finite 
kinetic energy, that is, tT = (¢, |p|¢) < oo. Let N = 1. Then 


1 
(He) He = Za a dx+Za > moe 88) 


1<k<l<M 


For a given ¢, we try to choose the nuclear positions R; so as to make this 
expression as small as possible. For an upper bound on the smallest possible 
value, we can average the expression (8.8.1) over the nuclear positions, with 
weight given by [lh 1 \f(R,,)|’. In this way, instability is established if it can be 
shown that 


1 
= (Zam = 5Z/aM(M = ») 2 
where Z denotes 


2 2 
eS) lox)" le) diay, 


|x — y| 
R3xR3 
For a given value of Z we can choose M such that |M — 1/2 — 1/Z| < 1/2. 
Then 


ZoM +472 M(M js Z’a|M 1_4y | 2 (1+ Z) 
a 5) a = 5 a 5) Z 4 a a 


<-<xa. 


Instability thus occurs for a > 2t/Z. To obtain a numerical value for t and Z, 
we choose $(x) = 2~!/? exp(—|x|). Then t = 8/(3zr) and Z = 5/8, and hence 
2t/Z = 128/(15z). 

Finally, we show that condition (ii) suffices for instability. We take N = q 
electrons, and choose the wave function to be ee , P(x;) times the totally 
antisymmetric spin wave function given by x(a) = (N!)~ det 9; (o,)|* jae 
where g;(o) = 6,,. With the same notation as above, the energy of this state is 
given by 


Nr +aW0N, R) 
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where 


ool? 
me Y fom 


1 
+Z ee Se ar i+ 5 NW - Iz. 
Here, the last term is the repulsive interaction among the electrons. 
When discussing the instability of non-relativistic bosons in Chapter 7, 
Section 7.6, we already encountered the expression W(N, R) and showed that 
it is possible to choose the nuclear positions R in such a way that 


W(N, R) < —CZ°7N%° 
for some constant C > 0 depending only on @. Therefore, instability occurs if 
ge = Caz 9 20, 


or 
3] 
(Za)a > (=) — 
C/ q 


The numerical value for t/C depends on the choice of @. We refer to [138] for 
details of an appropriate choice of ¢ and a careful calculation of the constant C 
(which leads to the stated value of D). | 


We now turn our attention to relativistic stability for bosons. Recall that in 
the non-relativistic case discussed in the previous chapter, bosonic particles are 
unstable of the second (but not the first) kind, that is, the ground state energy 
goes like —N*/? in the case of static nuclei. In the relativistic case, the situation 
is even worse, in the sense that even stability of the first kind fails for large 
enough particle number. 


Theorem 8.3 (Instability for Bosons). For every a > 0 and Z > 0 and for 
sufficiently large M and N (depending on a and Z), the fundamental Hamilto- 
nian (8.1.1) for N bosons is unstable of the first kind as long as Z; = Z for all 
l<j<mM. 


Proof. As pointed out in Chapter 3, Section 3.1.3.1 and Corollary 3.1, the ground 
state energy of N bosons is the same as the one for NV fermions with g = N spin 
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states for each particle. Hence by Theorem 8.2, condition (ii), there is instability 
for each fixed Z and a if g = N is large enough. | 


Remark 8.7. The instability result in Theorem 8.3 refers to static nuclei. If 
the nuclei have a finite mass and their kinetic energy is taken into account, the 
stability situation is a littke more subtle. The nuclei can be either bosons or 
fermions. If they are fermions we can apply the stability theorem, 8.1, to the 
positive nuclei instead of the negative bosons and see that stability is restored in 
an appropriate parameter regime. On the other hand, if some of the nuclei are 
bosons, then instability always occurs. This follows from the consideration of 
the non-relativistic case in the previous chapter, where it was explained that the 
energy of non-relativistic bosons goes like —N7/° for large N. To apply this to the 
relativistic case one simply notes that | p| < (p? + 1)/2, and hence the ground 
state energy in the relativistic case is bounded from above by +N — CN’ for 
some C > 0. It is thus negative for large enough WN and, therefore, equals —oo 
by scaling, as explained in the beginning of Section 8.5, Eq. (8.5.1). 


CHAPTER 9 


Magnetic Fields and the Pauli Operator 


9.1 Introduction 


In the preceding chapters we proved the stability of matter for electrons 
(i.e., fermions) interacting with fixed nuclei, both relativistically and non- 
relativistically. We showed that the proofs go through even in the presence 
of magnetic fields, meaning that one can replace p by p+ ./aA(x) for any 
choice of the vector field A. Indeed, the diamagnetic inequalities discussed in 
Section 4.4 show that the bounds obtained for the ground state energy can only 
improve when A is included (but this does not imply that the true ground state 
energy increases). 

In real life the energy of matter actually decreases significantly when a field 
is added and it can decrease to —oo if the field is allowed to become arbitrarily 
large. The reason is that the electron spins interact with the magnetic field, and 
hence an additional term must be added to the kinetic energy to account for 
this interaction. What prevents the field from becoming large, and the energy 
becoming arbitrarily negative, is the positive self-energy of the field (i.e., the 
energy needed to create the field). The goal is to show that the additional 
negative interaction energy is balanced by the field energy and an ‘equilibrium’ 
is reached in which the total energy is finite. We shall see that this is achieved 
if and only if @ and Za? are not too large — even in the non-relativistic case. 
Thus, the non-relativistic Hamiltonian we shall treat in this chapter mimics the 
relativistic one we treated in the previous chapter in the sense that a bound on a@ is 
needed. 

Only the non-relativistic model of particles with spin in the presence of 
magnetic fields will be treated in this chapter. The discussion of relativistic 
electrons together with magnetic fields will be deferred to Chapter 10 where the 
Pauli operator, treated here, will be seen to be the non-relativistic limit of the 
Dirac operator. 
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9.2 The Pauli Operator and the Magnetic Field Energy 


We recall the definition of the Pauli one-body kinetic energy operator given in 
Chapter 2, Eq. (2.1.45), 


(p + Va@A(x))” + ves - B(x). 


wl re 


1 
Ta(p) = 5 |o -(p + JVaA(x))|" = 


(9.2.1) 


The wave functions on which this operator acts lie in L?(R*;C* @ C1). We 
allow for g internal states (often called ‘flavors’ ) in addition to the two coming 
from the spin 1/2 nature of the particles. In order words, a wave function 
of one particle is a C* valued function (instead of the usual complex-valued 
function) of z = (x, 0), withx € Re andl <o < q, and the number of internal 
states, altogether, is 2g. With this convention, we can write the quadratic form 
corresponding to (9.2.1) as 


_ 
(hr, Tap) W) = 5 / \(p + Va A(x)) w|? dz + ve [vo - B(x))dz. 
(9.2.2) 


As explained in Chapter 2, | - | stands for the norm of a vector in C?.and{-, +) 
stands for the inner product on C?. Recall that the sum over o is contained in 
the dz integral, as in (3.1.15). 

We assume that A € Le (R? ; R3). The magnetic field B(x) is given as B(x) = 
curl A(x), which has to interpreted in the sense of distributions if A is not 
differentiable. We also assume that Emag(B) < 0, ie., B € L?(R3; R°). Recall 
that we choose units such that the electron mass, f and the speed of light all 
equal 1. 

When discussing the question of stability of matter for arbitrary magnetic 
fields it is important to consider the energy cost of creating such a magnetic 
field, namely the self-energy of B. As discussed in Chapter 2, Section 2.1.2, in 
our units this self-energy is given by 


I 
Emag(B) = = i |B(x)|?dx. (9.2.3) 
R3 


That is, when minimizing the energy over all possible wave functions w and 
magnetic fields B(x), it is important to add the self-energy in order to obtain a 
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finite result, as we shall see later. This was not necessary in Chapter 7, where 
the o - B term in the Hamiltonian was absent. The minimal energy defined in 
this way is called the ground state energy of the system. Stability of the first 
and second kind always refer to this energy of particles and field. 
The last term in (9.2.1) can alternatively be written as 
Ja 


= gg. 
“7 0 Bix) = Va5s B(x) 


where S = 0/2 is the electron spin, and g = 2 is the gyromagnetic ratio (or 
Landé g factor) of the electron. It is possible to study the Pauli operator (9.2.1) 
with a factor g/2 in front of the last term for general values of g, not just g = 2. 
One observes the following, however 


e The sign of g is important. If g < 0, it is not clear whether 


Jag 


2 (p+ J@A(x))” + To Be) (9.2.4) 


2 
is positive for all magnetic fields B(x). 

e For 0 < g < 2, the operator (9.2.4) can be written as a convex combination 
of (9.2.1) and (p + JaA(x))? and is thus positive. Moreover, by employing 
this fact the analysis of stability of matter can be reduced to the g = 2 case. 
Recall that in Chapter 7 we have shown the stability of matter in case g = 0, 
for any values of Z and w and any magnetic field. 

¢ For g > 2, (9.2.4) is not positive, in general. Even worse, the energy of one 
electron and one nucleus can be driven to —oo by increasing B, even if the 
self-energy of the magnetic field is taken into account! There exist particles 
in nature with g factors bigger than 2, or even negative. These are composite 
particles, however, like protons (g = 5.6) which are made out of three quarks. 
When analyzing the stability of matter with such particles, one can not ignore 
their finite size and internal structure. 


9.3. Zero-Modes of the Pauli Operator 


An important role in our analysis will be played by zero-modes of the 
Pauli operator (9.2.1), that is, a pair (W(x), A(x)) with w € L?(R?;C’) and 
curl A(x) € L?(IR*; R%) such that 


o-(p+ JaA(x))W(x) = 0. 
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Such a y is an eigenfunction of the Pauli operator (9.2.1) with eigenvalue 0. 
In addition, we demand that the magnetic field energy (9.2.3) of B =curlA 
be finite, and we stress that w must be square integrable. In other words, for 
special magnetic fields there is a genuine normalizable (and, therefore, time- 
independent) state with zero kinetic energy. Such a state does not exist when 
A = 0, i.e., for the Laplacian alone. There is no non-zero f € L?(R¢) such that 
Af =0. 

The importance of zero-modes for stability was recognized in [74], but their 
existence was uncertain. The first examples were given by Loss and Yau in [139] 
and the first one cited in that paper was 


3 
Ja A(x) = “Gay [dl —x?)w+2(w-x)x+2wax] = (9.3.1) 
where w = (0, 0, 1). The magnetic field in this case equals 
4 
B(x) = curl A(x) = ——~A(x). 9.3.2 
(x) = curl A(x) foe (x) (9.3.2) 


The corresponding spinor-valued function y € L7(R*; C7) with zero kinetic 
energy is 


w(x) =(14+ x7)? [1 +i - x] ia) . (9.3.3) 


Many more zero-modes were found later [1, 48, 45, 6], but their complete 
classification is still unknown. 

Notice that, by scaling, any zero-mode gives rise to a one-parameter family 
of zero-modes. This observation will be useful in the sequel. That is, if (yw, A) 
is a zero-mode then so is (y,, A,), for all A > 0, where 


Wilx) = 7? WAx), Ax(x) = AA(AX), (9.3.4) 
and 
By(x) = BAX),  Emag(By) = AEmag(B). (9.3.5) 


The scaling of yy, is chosen such that ||W|/2 = || vl. 

As in the example above, the dependence on a is rather simple. If (w, A) is 
a zero-mode for a = 1, then (y, aal/ 2A) is, by definition, a zero-mode for any 
a > 0. Under this transformation B also gets multiplied by a~!/?, and hence the 


field energy Emag(B) gets multiplied by a~!. 
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9.4 A Hydrogenic Atom in a Magnetic Field 


Before studying the question of stability of matter with the Pauli operator in 
full generality let us consider the simplest case of one electron in the field of 
one nucleus of charge Z. In this case we shall already see that stability can only 
hold if the field energy is taken into account. The Hamiltonian, including the 
magnetic field energy, is given by 
Za 
Hy = T,(p) - ia + Emag(B). (9.4.1) 
The fact that the field energy is necessary to stabilize the system is an immedi- 
ate consequence of the existence of zero-modes discussed in the previous section. 
In fact, with (w, a'/2A) being a zero-mode of T4(p) and with (Wy, a '/2A,) 
scaled as in (9.3.4), we have 


(Wa, Hy-ing, Wa) = —Za(Wr, 1X1! Wa) + Emag(o'/?B,) 
= —) Za(w, |x|7!W) + ~ Emog(B). (9.4.2) 


Two things can be seen from this expression. First, if we omit the field energy, 
we can drive the energy to —oo by increasing A to infinity. Second, Eq. (9.4.2) 
shows that there is a critical value of Za? beyond which the infimum (over y 
and A) of the total energy is —oo. 

Our goal in this section is to show that the critical Za” is not zero. We follow 
closely the discussion in [74]. In fact, we shall show the following. 


Theorem 9.1 (Stability of Hydrogen for Small Za”). Let H,4 be given in 
(9.4.1). Under the condition that 


3\3/2 
Zo? < > (=) (9.4.3) 


we have 
(W, Ha) = (Za), v) 
for any w € H'(R*; C’) and for any magnetic vector potential A(x). 


Recall from (2.2.17) that if the Zeeman term o - B is omitted the infimum of 
(wW, Haw)/(w, w) (over all w and A) equals —(1/2)(Za)’. 
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The theorem states that a hydrogenic atom is stable if Za? is not too large. 
For the physical value aw = 1/137, the critical Z is seen to be at least as big as 


a2 (3)? ~ 19 160. 


The following proof is somewhat convoluted, but at present there does not 
seem to be any straightforward, simple path to the conclusion. 


Proof. We can assume that (Ww, w) = 1. Since T4(p) => 0, we have the lower 
bound T4(p) > ¢ Ta(p) for any 0 < ¢ < 1. Moreover, o - B(x) > —|B(x)|. 
Hence, for any w € HR? ;C?), 


Gh, Hat) = (v.[5@ + Vea? — 5 Va] BOx)| - Zale!" |) + Enag(B)- 


For given |y|*, we can minimize the second and fourth terms with respect to 
|B(x)|, which yields 


2, 
Emag(B) — 5a i Ive) P| Bex)idx = / IWx)Mae. 
R3 R3 


On the other hand, the diamagnetic inequality (4.4.4) implies that 


(Wy, (p + VaA(x))y) = i IV w(x)||?dx. 


Thus, we are led to investigate a lower bound for the quantity 
é 9 Za 2 aerr 4 
5 f IVIv@)Ildx — pe! dx — —— J Iv@)"dx. 4.4) 


We are free to choose any ¢ between 0 and 1 to make this expression as large as 
possible. For given y, the optimal choice of ¢ is 


-1 
c= in| [iviveoirer [2am fiw | r (9.4.5) 


Two cases have to be distinguished. If f |V|y(x)||?dx > 2am f |W(x)|*dx, 
then ¢ = 1| and the expression (9.4.4) is bounded from below by 


1 Z 
; / Vive) |I2de — / “a iwtaPde > (Za? i Iwe)Pde. 


The last inequality is just the ground state energy of the hydrogen atom, 
discussed in Section 2.2.2. In the contrary case, where ‘i \V|w(x)||?dx < 
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2am f |w(x)|*dx, the expression (9.4.4) becomes 


1 (SIVIW@)IPdx)” pf Za 
8ra ff |W(x)|4dx |x| 


for our choice of ¢ in (9.4.5). The Sobolev inequality discussed in Chapter 2, 
Eq. (2.2.4), states that 


3422/3 oe ia 2 
5 (4x?) [veo dx < [vive dy. 


Using this and Schwarz’s inequality we obtain 


1/2 1/2 
} l(x)4dx < ( / Iva) Pa) ( / w(x) ) 

Ana? 4 ; 1/2 ; 3/2 

Z (5) za(f ve ax) (f iio ax) 


In addition, from the ground state energy of the hydrogen atom we get 


1/2 12 : 
( / Vivitar ( i oa) > a dx 
(see Eq. (2.2.18) in Chapter 2). Hence (9.4.6) is bounded from below by 
3/2 
(z (3) - za) oes 
2a \4 |x| 


The condition f |w(x)|*>dx = 1 has been used. Under condition (9.4.3) the last 
term is positive for any yy, and hence (9.4.6) is positive. a 


|w(x)|?dx (9.4.6) 


The proof of Theorem 9.1 may not be the slickest possible, but it was his- 
torically the first and has the virtue of using only well-known inequalities used 
elsewhere in this book. The reader might wonder why we threw away the non- 
negative term (1 — ¢)74(p) in the lower bound. What has been done, in fact, is 
to control the dangerous o - B term by an optimal combination of two methods; 
one way is to use —é|B| as a partial lower bound, and the other way is to use 
—(1—e)(pt JaAy as a lower bound on the remainder. The zero-modes of 
T,4(p) play an important role for the stability question. If (w, A) is a zero-mode, 
then (W, T4(p)w) = 0 and hence nothing has been thrown away, in fact. 

It was shown in [74] that the critical Za? is entirely determined by zero- 
modes; that is, the minimum of the right side of (9.4.2) is zero when taken over 
all zero-modes (yr, A) when Za? is less than the critical value; it is —oo beyond 
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that value. The complete set of zero-modes is unknown, however, and therefore 
the exact value of the critical Za? remains unknown. 


9.5 The Many-Body Problem with a Magnetic Field 


The stability of the hydrogen atom with Pauli kinetic energy and magnetic fields 
was proved in [74] by Frohlich, Lieb and Loss. The relation to zero-modes was 
also observed there. For one nucleus and arbitrarily many electrons (the large 
atom), as well as for one electron but arbitrary nuclei (and with a required bound 
on @ as well as on Za’), stability was proved by Lieb and Loss in [117]. The 
obvious next goal was to show stability for all N and M. 

A solution to this problem was given by Lieb, Loss and Solovej in 1995 
[123] for values of Za? and @ that, while not quite as generous as the bound in 
the previous section, were more than sufficient to cover all physical situations. 
Earlier in 1995 Fefferman announced stability [57] for Z = 1 and sufficiently 
small a; the proof in 1996 [58] does not contain quantitative values of the 
constants. 

The results in [123] were subsequently applied to the stability of matter 
with quantized magnetic fields (Quantum Electrodynamics) in [27], a subject to 
which we shall return in Chapter 11. The additional difficulty there is that the 
field energy is a bit more complicated than (1/87) [ |B |", but this problem can 
be overcome, as we shall see later. 

Stability requires that Za be sufficiently small (as is apparent from the one- 
body problem), but it also requires that w itself be small (independent of Za). 
Conditions for stability are given in Theorem 9.2, while the fact that a must not be 
too large in order to have stability of the first kind is the content of Theorem 9.3. 

Paper [123] contains three proofs of stability of the Pauli Hamiltonian, and 
they are all remarkably short given what seemed earlier to be the difficult nature 
of the problem. We shall give the first of these here because it most closely 
utilizes the machinery already developed in this book. Another proof, which was 
developed in [127], will also be given in this section (after Corollary 9.1), and 
this is even shorter provided one uses a useful theorem of Birman and Solomyak 
[20], which will be proved in the appendix to this chapter. Interestingly, the 
numerical constants in the four proofs in [123, 127] (.e., the required bounds 
on Za? and q@) are not very far from each other. 


The Hamiltonian under consideration is 
N 


oi 2 Ta(P;) + aVc(X, R) + Emag(B), (9.5.1) 


i=1 
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where T4(p) is the Pauli operator in (9.2.1) and the Coulomb potential Vc(X, R) 
is given in Eq. (2.1.21). As explained in Section 9.2, the wave functions are 
antisymmetric functions having 2g spin states. 


Theorem 9.2 (Conditions for Stability). Let Z = max ;{Z;}, and let 


1 Z 
= — + 2.26 (2g)! 27/3 + 1.03 (2q)'?. (9.5.2) 
K 
The ground state energy of H in (9.5.1) satisfies 
a2 
Eg = (9.5.3) 
K 
provided 
2q a 3 
i gg (9.5.4) 
K 82? 


For the physical values ~w = 1/137 and 2q = 2, this theorem proves stability 
of matter for Z < 953. 


First proof of Theorem 9.2. The first main ingredient in the proof of 
Theorem 9.2 is the relativistic stability of matter proved in Theorem 8.1 in 
the previous chapter. That is, for « given in (9.5.2), 


N 
Yo lp; + Ve A(x:)| + Vc(X, R) > 0. (9.5.5) 


i=1 


The Hamiltonian (9.5.1) is, therefore, bounded from below by 


N 
H > Yo ( Tali) — <1p; + V@AG)|) + Enag(B). 
i=1 


This is just a sum of one-body operators, plus the magnetic field energy. Let 
h @ Ica be this one-body operator, namely 


K= tn |P + JaA(x)|, 


which acts on L?(IR?; C’). On the space of totally antisymmetric wave functions, 
the lowest eigenvalue of pa (2 ® Ica); is equal to the sum of the lowest NV 
eigenvalues of h @ I[c:. 
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Let w > 0, and writeh = —u + (h + 2). The sum of the lowest N eigenvalues 
of (A + 4) @ Ica is certainly bounded from below by the sum of all the negative 
eigenvalues of (1 + 2) ® Icv. The latter can be written as 


—q j N.(hjde, 


where N.(Q) denotes the number of eigenvalues below —e of an operator O. 
For any jz > 0, the Hamiltonian H is thus bounded from below as 


—Nu-gq / N-(h) de + Emag(B). (9.5.6) 


Using the positivity of T4(p), we further estimate T4(p) > (/e)Ta(p) for 
e > mw. Moreover, 


2 
“|p — VaA(x)| < —5(p + Va Alay)? + © 
K 2ek 2 
by Schwarz’s inequality. The parameter jz is at our disposal and we choose it to 
be w = 4(a/x)*. Using, in addition, that 0 - B(x) > —|B(x)|, the operator h is 
then bounded from below sie 


h>h.= 22 (p+ Vaan)? 


To get an upper baie on the number of es of h, mie —e, we can 
use the CLR bound (the y = 0 case of Theorem 4.1) to obtain 
4 2.2 73/2 
a = |. dx. 


Neth) < Nelhe) <2Loa f ae 1- 


(9.5.7) 
R3 


As usual, [t]4+ = max{0, t} denotes the positive part. The factor 2 on the right 
side results from the 2 spin states. Recall that the best known upper bound on 
Lo,3, given in Eq. (4.1.17), is Lo.3 < 0.116. 

After integration over e, (9.5.7) becomes 


jo f[Snon- 2s 


4(a/K)? R3 


, e K a2 IT a2 
< fee [| 5r6 x)| - al dx = ae ae 


a 
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Consequently, 


820 nf 


Stability thus holds as long as (9.5.4) is satisfied. a 


2 1 2 
H>-4-N + (5 —2q Loa) i |B(x) Pax. 
K 3k 
R3 


For not too small Z our bound (9.5.3) is quite reasonable since the energy 
should be of the order of (Za)*. For small Z, however, the result is far from 
optimal. The reason for this defect is that even if Z is very small (9.5.5) does not 
hold for large «. This is a consequence of the instability results for relativistic 
matter in Theorem 8.2. 

The previous proof uses the CLR bound, which was stated in Chapter 4, 
Theorem 4.1, without proof. If one wanted to use only results proved in this 
book, one could avoid the use of the CLR bound and use Corollary 4.2 instead. 
The divergence of the bound in Corollary 4.2 for small e does not cause any 
trouble here, since the integration in (9.5.6) is only over e > yu anyway. The 
bounds on Za” and «& to ensure stability, obtained in this way, may turn out to 
be worse, however. 

For use later we shall explicitly display the following inequality, which is at 
the heart of the proof of Theorem 9.2. We call it a corollary, although it is really 
a corollary of the proof and not the theorem per se. 


Corollary 9.1 (Stability with Restricted Field Energy). Let we 
Ge L?(R?; C4) be a normalized, antisymmetric N electron wave function, 
whose density Qy is supported in the set & C IR? (which could be all of R?), and 
letk > 0. If 


then 


i=1 


1 2 
(+. [rate ~ Sia, + verse “) + gy | Benes = -4N—. 
: (9.5.8) 


The point of this corollary is that all the bounds hold even if only the magnetic 
field energy in the support of @y is utilized. An inspection of the proof of 
Theorem 9.2 shows that only the magnetic field on this support is relevant. 
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Inequality (9.5.8) will be important in Chapter 11, where the Pauli operator with 
quantized electromagnetic field will be studied. 

As promised above, we shall now give an alternative proof of Theorem 9.2. 
It is taken from [127]. 


Second proof of Theorem 9.2. We start with the same observation as in the 
beginning of the first proof. Namely, from stability of relativistic matter we 
have 


N 
a 
H > Y0 (Tali) — <1p; + VAAG)|) + Enag(B) 
i=1 
for x given in (9.5.2). Since T,4(p) > 0, we can bound 74(p) from below as 


2 
Ta(p) = AV Ta(p) — 7t 


for A > 0. Thus 


a i 
H>-Tr E Ta(p) ~ =|p + JaA(x)|] — SN + Enag(B). 


The notation Tr[O]_ simply stands for the sum of the absolute value of the 
negative eigenvalues of a self-adjoint operator O. Likewise, Tr[O]” is the sum 
of the p" powers of the absolute value of the negative eigenvalues. 

There is a useful inequality for traces of differences of fractional powers of 
operators, which was proved by Birman and Solomyak in [20]. It says that 


Tr[B — A]_ < Tr[B? — A2]!” 


for positive operators A and B. For the convenience of the reader, we present a 
proof of this inequality in the appendix to this chapter. As a result 


2 1/2 
Tr [AVTa(p) - <p + V@A(w)|| =< Tr [227400) — (p+ Yaa)? | 


Since o - B(x) > —|B(x)|, 


a IP a a 
WTa() — (Pp + Va A(e))? > (= = =) (p + VaA(x))’ — 22 Bee), 
We shall choose 2 > /2a/k. The trace of the square root of the negative part 


of this operator can be bounded with the aid of the LT inequality (Theorem 4. | 
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for y = 1/2). The result is 


2 2 
Tr I€ = <) (p+ JaA(x))? — - 2% ia] 


2q M40 5 

a | |B@Pax. (9.5.9) 
x a 

? (F — «) R? 


(Recall that there are 2g spin states.) Hence 


= Liga 


i 1 2q M40 4 
H>-—N+| —~-Lin3—*"* | | |BeoPar. 


4 8 a? 
. 4 (3 ~ a R3 
With the choice 4 = /8a/«, stability is thus shown if 


2q a p I 37 
K 82 16 Li 2,3 


(9.5.10) 


As explained in Chapter 4, Eq. (4.1.18), Li/2,3 < 6.87 Le 1/2,3 = 0.0683. With 
this estimate, the right side of (9.5.10) turns out to be the same as the right side 
of (9.5.4) up to four figure accuracy. | 


The two proofs give remarkably similar results! The second one is infinites- 
imally better and shows stability in a slightly bigger parameter regime. For the 
physical value of a and q, the first proof shows stability for Z < 953, while the 
second proof works for Z < 954. This should be regarded as no more than a 
coincidence, since there are ways to improve the results in both methods. 

Finally, we show that small @ is actually needed for stability. The following 
theorem shows this and also shows that bosons are always unstable for any value 
of Za? > 0. This is analogous to the relativistic case discussed in Section 8.8. 


Theorem 9.3 (Conditions for Instability). There is a constant D > 0 such that 
the Hamiltonian (9.4.1) is unstable of the first kind for large enough N and M 
if one of the following two conditions hold: 


(1) Z jot > (2 /2)(3/4)7/? for some j, 
(2) a> D/q'. 
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Proof. The proof follows closely the corresponding proof of Theorem 8.2 in the 
relativistic case, and is even simpler. 

Instability under condition (1) has already been shown in Section 9.4. We shall 
now prove instability under condition (2). Given the instability under condition 
(1), it clearly suffices to consider the case Z < q. Pick a zero-mode (w, aa l/ 2A) 
of the Pauli operator, with w € L7(R?; Cc), We take N = q, in which case all 
the electrons can be put in the state w (cf. the discussion in Section 3.1.3.1). That 
is, we can take the wave function to be [[#_, w(x;) times a totally antisymmetric 
function in spin space. If tT = Cl B) denotes the magnetic field energy and Z 
denotes twice the self-energy of a charge distribution ||’, ie., 


2 2 
_ i Paco Mbae) 


dxdy, 
Ix — y| 


R3 x R3 


the energy of the system is given by 


«+ Sag -)- 20 | om dx+ Za >~ 


l<k<l<M [Rx — Ril 
(9.5.11) 


We can choose the nuclear positions R; so as to make this expression as small 
as possible. For an upper bound on the smallest possible value, we can average 
the expression (9.5.11) over the nuclear positions (as we have done several times 
before), with weight given by gran |w(R,)|. In this way, the energy is negative 
if 

T 


Ta 2 
aia (—2ZaMq + Z*M(M — 1)+q(q —1)) <0. 


Since 0 < Z < q, we can pick an M such that |ZM — q| < 1/2. Then 
—2ZaMq + Z?M(M —1)+q(q—1)=(ZM—qy —Z’7M—q <1/4—q. 
Hence the energy is negative if 


> 20 
> ———_—_.. 
Z(q — 1/4) 
Once the energy is negative, it can be driven to —oo by length scaling. (See the 
discussion at the end of Section 9.3.) | 
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9.6 Appendix: BKS Inequalities 


As a convenience to the reader we give a proof of some cases of the inequalities 
due to Birman, Koplienko, and Solomyak [19]. It is taken, with minor modifi- 
cations, from [127, Appendix A]. The case needed in Section 9.5 corresponds 
to p = 2 below, which was proved earlier by Birman and Solomyak in [20]. In 
Section 9.5 we were interested in (B — A)_, but here we treat (B — A) to 
simplify keeping track of signs. The proof is the same. Recall that X,:= 
(\X| + .X)/2. 


Theorem 9.4 (BKS Inequalities). Let p > 1 and suppose that A and B are two 
non-negative, self-adjoint linear operators on a separable Hilbert space such 
that (B? — APY? is trace class. Then (B — A) is also trace class and 


THB = Aly S THB? = AP)”, 


Proof. Our proof will use essentially only two facts: X +> X~! is operator 
monotone decreasing on the set of non-negative self-adjoint operators (i.e., 
X>Y>0=5Y7!>X7!) and Xb X’ is operator monotone increasing 
on the set of non-negative self-adjoint operators for all 0 <r <1. (See the 
footnote on page 87.) Consequently, X +» X~" is operator monotone decreasing 
for 0 <r <1. We refer to [17, Chapter V] for details on operator monotone 
functions. 

As a preliminary remark, we can suppose that B > A. To see this, write 
B? = A? + D. If we replace B by [A? + D,]'/? then (B? — A”), = D, is 
unchanged, while X := B — A becomes [A? + D,]'/? — A and this is bigger 
than B — A because of operator monotonicity of the map X +> X!/? on positive 
operators. Since the trace is also operator monotone, we can therefore suppose 
that D = D,,i.e., B? = A? + C? with A, B, C > 0. Our goal, then, is to prove 
that 


Tr[(A? +. C?)'/? — A] < Tr C, (9.6.1) 


under the assumption that C is trace class. 
To prove (9.6.1) we consider the operator X := [A? + C?]!/? — A, which is 
well defined on the domain of A. We assume, at first, that A? > e” for some 
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positive number ¢. Then, by the functional calculus, and with 
E:=[A?+cC?]°-P/P and P:= AP -E 

we have 
X = E[A?+C?]— Al PA? = —PA? + EC?. (9.6.2) 


Clearly, P > Oand0 < P<e!-?, 

Let Y:= EC?. We claim that Y is trace class. This follows from 
YtY = CPE2CP < CPC2-2PCP = C2. Thus, |Y|<C, and hence TrY = 
TCPCEC? 2 TrC: 

It is also true that P is trace class. To see this, use the integral representation 
AlP =c [y(t + A?) |'t"-?)/? dt for suitable c > 0. Use this twice and then 
use the well known resolvent formula, which is 


1 1 a ge 1 
AP+t AP+CP+1 AP+t. AP+CP4t° 


In this way we find that 
[oe 
P= c fran 4 ty CP(AP +? +0) 120 PY at, 
0 


Since C is trace class, so is C?, and the integral converges because of our 
assumed lower bound on A. Thus, P is trace class and hence there is a complete, 
orthonormal family of vectors v1, v2, ..., each of which is an eigenvector of P. 

Since X > 0, the trace of X is well defined by Parr Xvj) for any com- 
plete, orthonormal family. The same remark applies to EC? since it is trace 
class. To complete the proof of (9.6.1), therefore, it suffices to prove that 
(v;, PA? v;) = 0 for each j. But this number is 4 ;(v;, A? vj) = 0, where A; is 
the (non-negative) eigenvalue of P, and the positivity follows from the positivity 
of A. 

We now turn to the case of general A > 0. We can apply the above proof to 
the operator A + e¢ for some positive number ¢. Thus we have 


Tr [[(A +e)? +C7]'/? —(A+e)] <TC. (9.6.3) 


Let @, @2, ... be an orthonormal basis chosen from the domain of A”. This basis 
then also belongs to the domain of A and the domain of [(A + €)? + C? ]'/? for 
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all ¢ > 0. We then have 
Tr X = )(y;, Xg;). 
Jj 


Note that a priori we do not know that the trace is finite, but since the operator 
is non-negative this definition of the trace is meaningful. Operator monotonicity 
of X!/P gives 


(9;, [A +6)? +C7]'? —(A+.)] 9) = (j, (X — 8g). 


It therefore follows from (9.6.3), followed by Fatou’s Lemma [118, Lemma 1.7] 
applied to sums, that 


Tr C> liming ) (9;, [A +6)? +C7]!/? —(A+.8)] 9;) 
J 


> DT limint (yj, [[(A + 6)? + C7)” — (A +6) @)) 
Jj 


> Tr X. 


CHAPTER 10 


The Dirac Operator and the 
Brown—Ravenhall Model 


10.1. The Dirac Operator 


In the previous chapters we have seen that matter is stable of the second kind 
if the negative particles are fermions with a fixed, finite number of “spin states’ 
per particle. We found, however, that stability requires a bound on the nuclear 
charge Z and on the fine structure constant w. The non-relativistic ./a 0 - B(x) 
Pauli interaction of the electron spin with the magnetic field required a bound 
on Za? and on @, in addition to the compensating positive field energy [\B 7, 
Relativistic mechanics, on the other hand, in which |p + JaA(x)|? is replaced 
by |p + ./a@A(x)|, but without the Pauli term, required a bound on Za and 
ona. 

The next chapter in the stability saga ought to be the combination of magnetic 
interaction together with relativistic kinetic energy. Thus, we could consider 
replacing the Pauli kinetic energy operator |o - (p + ./a@A(x))|? simply by its 
square root |o -(p + ./a@A(x))| or, if we wish to include the electron’s mass, 
by /[o - (p + /a@A(x))]? + m2 — m. This was actually considered in the early 
days of quantum mechanics but was rejected for various reasons, one being that 


this is not a local operator! like the Laplacian A; moreover, it gives the wrong 
spectrum for hydrogen. 

The accepted relativistic generalization of Schrodinger’s kinetic energy oper- 
ator — A was invented by Dirac in 1928 [39]. Unlike | p| it is a local operator; in 
fact, it is a first order differential operator. For a particle of mass m it is given, 


' The notion of locality of an operator O means that the matrix elements (y, O @) vanish for any 
two functions w and ¢ of disjoint support. Hence any multiplication operator is local, including 
the Coulomb potential. The non-relativistic kinetic energy is local, too, but not the relativistic 
kinetic energy. 
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in our units, by 


Da =a- (p+ /JaA(x)) + Bm (10.1.1) 


where a = (a!, a’, a) and 6 are 4 x 4 matrices, satisfying the anticommutation 
relations 


dai +alo! =26;Ic, ai B+pai'=0, Bp? =I. (10.1.2) 
(We hope the reader will distinguish « and a; the notation predates us.) A 
particular representation, which we shall use in the following, is 


i 0 o! = Ic 0 
ae & : i; a= ( ; es (10.1.3) 


where a! are the Pauli matrices defined in Eq. (2.1.50). The Hilbert space for one 
particle has changed from L?(R?; C”) to L7(R*; C*). The operator p appearing 
in (10.1.1) is p = (p'" p, p*) = —iV, as before. 

The reason Dz, is like the square root of the Pauli operator 74 in (9.2.1) is 
easily seen by computing its square: 


gee en 0 , ) (10.1.4) 
0 [o - (p+ JaA(x)) | +m? 
The Dirac operator thus represents a sophisticated way of taking the square root 
of the Pauli operator. 
Note that D4, is not positive or even bounded from below. In fact, the spectrum 
of D4 and —D, are identical, since —D4 = UD,4U' with 


_(0 Ie 
u=(_4.. ): (10.1.5) 


In fact, under this unitary conjugation the a’ and 6 change sign. Since D> >m 
by (10.1.4), D4 has no spectrum in the interval (—m, m). 
The simplest case to consider is the one of just one particle without any 


2 


external potential, and without any magnetic field. The Hamiltonian is then Dp = 
a - p+ Bm and its ground state energy is —oo. To get around this problem, Dirac 
proposed to consider only the positive energy subspace of Do, interpreting the 
negative energy states to be ‘filled’.* That is, the one-particle space L7(R?; C*) 


2 This consideration led Dirac to the prediction of antiparticles as unfilled ‘holes’ in the otherwise 
filled subspace of negative energy states. These particles, called positrons, were discovered a 
few years later. 
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splits into a direct sum Hj ® Ho, where H; denote the positive and negative 
spectral subspaces of Do, respectively. On H,., Do is positive by definition, and 
hence there is no stability issue for Do alone. 

The following explicit representation of ices will be useful in the following. 
Since the projection onto Hg is given by (1+ Do/|Do|)/2, a function w in Te 
has a Fourier transform of the form 


= u(k) 
wk) = 2no0-k u k) (10.1.6) 
a/ (20k)?+m2+m 


with u € L?(R*; C7). Conversely, for any such u the function (10.1.6) is in ars F 
The norm and kinetic energy are given by 


(20k) 
Wwe / (u(k), wk) | 1+ . | dk 
R3 (Jerk? +m? + m) 
(10.1.7) 
(20k)? + m? 
,D =2 k), u(k dk. 
(Wr, DoW) fo 1 0) on 
Here, (-,-) denotes the inner product in Cc. 
In the special case m = 0, these formulas simplify to 
(HW) =2 | (uk), wk) dk 
R3 
(10.1.8) 


(YW, Dow) = 2 fu u(k)) |27k| dk. 


R3 


We display them explicitly as they will be useful later in Section 10.5. 


Up to now all the models in this book could be defined via quadratic forms, and it was not 
necessary to talk about self-adjoint operators and their spectra. In connection with the Dirac 
operator, however, the notion of spectral subspace is unavoidable, at least in the presence of 
magnetic fields. Without magnetic fields one can just use the explicit representation (10.1.6) 
for the space 71,7, and no spectral theory is required. With magnetic fields, as discussed below, 
no such explicit representation is available, however. For details on operator theory, we refer to 
standard textbooks, e.g., [150, Vol. 1]. 
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10.1.1 Gauge Invariance 


Like the Schrédinger operator (p + ./a@A (x))*, the Dirac operator D4 in (10.1.1) 
is gauge invariant. This means that for any function x there exists a unitary 
operator U(x) such that 


Dasvy = U(X)DaU(y)". 


In fact, U(x) is the multiplication operator e'¥’%) @ Ics. In particular, the 
spectrum of D4 does not change when the gradient of an arbitrary function is 
added to the vector potential A, but the spectral subspaces do change. Note that 
x does not affect the magnetic field B, since curl Vx = 0. 

One possible choice of gauge is div A(x) = 0. The following lemma shows 
that if the magnetic field energy is finite, one can always find an A such that 
curl A = B anddiv A = 0. In addition, A can be chosen to go to zero at infinity 
in a weak sense; more precisely A € L°(IR*). These questions were touched 
upon in Chapter 2, Section 2.1.2. 


Lemma 10.1 (Coulomb Gauge). Assume that the vector field B is in 
L?(R?; R3), ie., Emag(B) < 00, and assume that div B(x) = 0 (in the sense 
of distributions). Then there exists a unique A € L®(IR3; IR?) such that 


curl A(x) = B(x) and div A(x) =0 (10.1.9) 
(again, in the sense of distributions). 


Proof. If Bk) denotes the Fourier transform of B(x), we can take A(k) = 
(27i|k|?)'k A Bik). Note that k - Bk) = Osince div B = 0, whence (27k) A 
A(k) — Bik), which is the same as saying that curl A(x) = B(x). Moreover, 
k. A(k) = 0, meaning that div A(x) = 0. Hence |k|A(k) et. 

Sobolev’s inequality (2.2.4) will imply that A € L°(R*) if we can show that 
A goes to zero at infinity, in the weak sense that the measure of the set S,, = {x : 
|A(x)| > mw} is finite for every 1 > 0. Foranye > 0, Ff Ak) = A(k)O(\k| — €)is 
in L*(R3), where © is the Heaviside step function defined in (6.6.4); moreover, 
A(k)O(e — |k|) is in L'(R%) and its Fourier transform is uniformly bounded 
by Sii<e |A(k)|dk < Ce!/?||Bl|2, using Schwarz’s inequality. Choose ¢ small 
enough such that Ce!/?|| Bll. < w/2. Then Sy Cc {x : |f.(x)| => w/2}, which 
has finite measure since f, € L’. 

Finally, to show uniqueness, note that if both A; and Ap satisfy (10.1.9), 
their difference is curl-free and hence must be the gradient of a function x. 
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Since div A; — div Ay = div Vx = Ax is zero, x is a harmonic function. But 
the only harmonic function whose gradient goes to zero at infinity is the constant 
function, and hence A, = Ap. | 


The particular gauge given by (10.1.9) is called the Coulomb gauge. The 
condition that A goes to zero at infinity fixes A uniquely. This gauge plays 
a particular role in quantum electrodynamics, as we shall see in the next 
chapter. 

We denote the set of vector potentials in the Coulomb gauge with finite 
magnetic field energy by A, ie., 


A:={AeL*, divA=0,curlA € L’}. 


In the Coulomb gauge (10.1.9), the field energy (2.1.16) can alternatively be 
expressed as 


1 
Emag(B) = 3— / |VA(x)|"dx, 
R3 


where |V A(x)|* is short for yy |V A'(x)|*. This follows from the fact that 
|VA|? = |div A|? + |curl A|?. 


10.2 Three Alternative Hilbert Spaces 


The many-body Hamiltonian for relativistic electrons in a magnetic field is 
formally given by 


N 
H =~ Daj + aVce(X, R) + Emag(B) (10.2.1) 
i=l 
in analogy with the non-relativistic case. The sophisticated reader will notice 
that this theory is not really relativistic. To make such a theory one would 
have to assign a four-dimensional space-time coordinate to each electron just 
as we assign a three-dimensional coordinate in non-relativistic theory. While 
there have been many attempts to construct such a multiple time formalism no 
totally satisfactory model of this kind exists. The true solution, as it is currently 
understood, lies in the formalism of quantum field theory, in which only fields 
exist and particles are regarded as excitations of the field. Despite these concerns 
we shall continue to pursue the question of the stability of the Hamiltonian 
described by (10.2.1). 


186 Dirac Operator and the Brown—Ravenhall Model 


Although it is not truly relativistic, the Hamiltonian (10.2.1) is used in one 
form or another in practical calculations of energies of atoms and molecules, 
and remarkably accurate results have been attained with its use. It can not be 
taken literally without some modification, however, for reasons we shall now 
explain. 

Since H in (10.2.1) is not bounded from below, its ground state energy 
(defined as in the non-relativistic case as the infimum of (wv, Hy)/(w, w) over 
all w 4 0) is not a useful concept. For N = 1, i.e., the hydrogenic atom, the 
spectrum of H consists of discrete eigenvalues in the interval (—m, m) together 
with continuous spectrum in its complement (—oo, m] U [m, oo). These discrete 
eigenvalues can be interpreted as electron bound states; we shall discuss this 
further in Section 10.3 below. For N > 2, however, the spectrum of H consists 
of the whole real line without gaps, and eigenvalues would not normally be 
expected to exist. This was observed by Brown and Ravenhall in [23, 16], and 
results from particles in the positive energy subspace of the Dirac operator 
interacting with particles in the negative energy subspace. 

There are several alternative ways of addressing the question of stability, three 
of which we shall discuss here. 


10.2.1 The Brown—Ravenhall Model 


A solution of the problem addressed above, proposed by Brown and Ravenhall 
[23] on the basis of perturbation theoretic considerations, is to allow the elec- 
tron wave functions to live only in the positive energy subspace of the Dirac 
operator Do. This is in accordance with Dirac’s original suggestion that the 
negative energy states are completely filled with fermions, and hence, by the 
Pauli principle, only the positive energy states are available to the electrons." If 
Aj denotes the projection onto the positive spectral subspace of Do, the ground 


4 The Brown-Ravenhall model is also sometimes called the ‘no pair’ model for the following 
reason. A more accurate model would have additional terms in the dynamics that cause electrons 
in the filled negative energy sea to move to the positive energy subspace. The resulting ‘hole’ 
in the sea is an electron ‘antiparticle’ called a ‘positron’ and this is identified, physically, as a 
particle. Thus, the missing terms ‘create’ an electron—positron pair, but the creation of a pair, 
even if it were included in the model, would require the expenditure of a relatively large energy 
(2mc’). The absence of such pair production effects is a defect of the model, but it is regarded 
as a minor defect in the world of quantum chemistry, where 2mc? is very much larger than other 
energies under consideration. 
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state energy in the Brown—Ravenhall model is thus 


N 


Ey = inf \(v, Hy): we A\[ASL>R’: C4], Iivil2=1, Ae AP. 


Here, we choose to minimize over vector potentials in the Coulomb gauge 
div A = 0. Although this may seem arbitrary, it is important for the following 
reasons. The Brown—Ravenhall model is not gauge-invariant, since a gauge 
change can not be incorporated into the wave function in the usual way by 
the replacement y(x) b> e! vax (x), for this would generally take the wave 
function out of the space a L?(R?; C*). Therefore, the energy does depend on 
the choice of gauge in this model. The condition div A = 0 (together with the 
restriction that A vanishes weakly at infinity) seems natural here, since without 
it Eg would simply be —oo. In fact, for any y the energy could be driven to —oo 
by adding AV x to A for a suitable x such that (W,a-Vyxw) £0 and letting 
A> Ow. 

Even with the Coulomb gauge condition, it is not obvious that the Brown— 
Ravenhall model is stable. Indeed it is not — at least not for large enough N 
regardless of how small a@ might be. This will be discussed in Section 10.5 
below. It turns out that the Coulomb potential Vc is irrelevant for the instability! 


10.2.2 A Modified Brown—Ravenhall Model 


In order to define a gauge-invariant model and at the same time ensure that the 
kinetic energy of the electrons is always positive, one can define the relevant 
electron subspace to be the positive energy subspace of D4, not Do. That 
is, the Hilbert space now depends on the choice of the vector potential. The 
resulting model is now gauge invariant. Because of this gauge invariance, it is 
not necessary to work in the Coulomb gauge. 

For this modified Brown—Ravenhall model, the ground state energy is 


N 
Ey = ot fom Hw):we /\ [AQZL7(R*; C4], Ile = 1, / |curl Al? < ~| 


Here, Ay denote the projection onto the positive energy subspace of Da, i.e., 
Leh = (D4 + |Da|)/(2|Da|). The infimum is taken over wave functions yw and 
vector potentials A. 
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The reader might object to the fact that the electron’s Hilbert space depends 
on the magnetic vector potential A, a situation that we have not encountered 
before. This connection between A and the state of the electron is not unnatural, 
however, if we interpret the field A as one that is generated by the electrons 
themselves. From this perspective, the elementary object is not a bare electron 
but an electron dressed with its magnetic field, which in some ways is more 
natural. We will encounter this situation again in the next chapter on quantized 
electromagnetic fields. 

It turns out that the modified Brown—Ravenhall model is stable provided a 
and Za are both not too large. This will be proved in Section 10.4. 


10.2.3 The Furry Picture 


It is possible to go one step further and include not only the magnetic vector 
potential in the definition of the relevant electron space but also the one-particle 
part of the Coulomb potential Vc, i.e., the interaction energy between the 
electron and the nuclei. That is, the electron Hilbert space is now the positive 
energy subspace of 


Z; 
yy (10.2.2) 


This is sometimes referred to as the Furry picture (after W. Furry [75]). This 
choice trivially leads to stability, since there are no negative terms left in the 
energy. If the operator (10.2.2) has eigenvalues < 0, one may want to include 
the corresponding L? eigenfunctions in the electron space. 

The Furry picture is often employed in calculations, and there is some justi- 
fication for it [142, 176]. The Hilbert space now depends on the positions of the 
nuclei, however, but these are usually regarded as macroscopically determined 
parameters in the problem and should not, therefore, influence the fundamen- 
tal Hilbert space of the microscopic particles in the system. Since stability is 
automatically ensured in this model, we have no need to discuss it further.° 


> The stability problem has not completely disappeared, however, since one still has to be able to 
define the operator (10.2.2). To do so one needs that Z;a < 1 if all the R; are distinct. Compare 
with footnote 6 on page 191. 
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10.3. The One-Particle Problem 


Before discussing the general many-body problem, let us first investigate the 
case of one electron in the three different pictures described in the previous 
section. We start with considering a single particle in a magnetic field, and later 
discuss the hydrogenic atom. 


10.3.1 The Lonely Dirac Particle in a Magnetic Field 


Consider a single electron in the presence of a magnetic field with vector potential 
A. In the Brown—Ravenhall model discussed above, the question of stability 
already arises in this simple system. In fact, we shall show that the question of 
stability depends on the value of a. 

Recall that ae denotes the positive spectral subspace of Dp = a - p + Bm. 


Lemma 10.2 (Stability Criterion for a Single Particle). There is a finite 
critical value of a, denoted by a,, with 31 2-4/3 < a. < 00, such that 


inf ((/, DaW) + Emag(B) : WE HG. lIWll2 = 1, A € A} 
is finite fora < a. and —& fora > Qe. 


Proof. The fact that a, < oo follows from scaling. We cannot, in fact, scale w 
directly since this would take us out of Ha (which depends on m in a non-trivial 
way). The terms p, A and €pag(B) all behave like an inverse length under scaling; 
only m is scale independent. We can, however, use the representation (10.1.6) 
for w € Hg and scale the corresponding u by A~7/?u(p/A). For any fixed u, 
pick an A such that (y, « - Ay) <0 for all small m. Rescale u according to 
u(p) > A~7/2u(p/a), A(x) > AA(Ax), and let y be given by (10.1.6). Then, 
for small enough A, (Ww, Daw) + Emag(B) will be negative for a large enough. 
The energy can then be driven to —oo by letting 4 tend to +00. Note that in 
the limit of large 4, |p| becomes large and hence the m in (10.1.6) becomes 
irrelevant. 
To show that a, > 0, we first note that for w € He F 


(Wy, Dow) = f VOrkF Fm Pdo/"ak. (10.3.1) 
R3 
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where lWik)| denotes the Euclidean C* norm of the C* vector Wk). This can 
be seen by simply noting that Do = Aj |Dol Ag — Ag |Do|Ag. with Ag the 
projections onto 715, and using the fact that |Do|* = p? + m?, see Eq. (10.1.4). 
Using (10.3.1) and Sobolev’s inequality (2.2.12) we thus obtain 

2/3 


(W, Dow) = 5! / woepax ) 
R3 


with S’ = (277)'/3. Moreover, since « - A(x) > —|A(x)|, 


(y,a- A(x)y) > — i |A(x)Il be) Pdx > —lAllellvlli2/5 = —lAllelv Iisilv lla, 
R3 
where we have used Holder’s inequality in the last two steps. Finally, again using 
the Sobolev inequality, but this time Eq. (2.2.4) for | p|? instead of Eq. (2.2.12) 
for |p|, 
1/3 
Eme(B) = = f IAGO Pde > 3 (F) AIR 
R3 
Using || w||2 = 1, the energy is then bounded from below by 


3 1/3 
(Wr, DaW) + Emag(B) = 2”)! IhWI3 — VallAllolvls +5 (2) WAL, 


which is positive for all y and A as long as @ < 3274/3, | 


In the modified Brown—Ravenhall model, where the electron Hilbert space is 
the positive spectral subspace of D4, instead of Do, stability of a single particle 
is not an issue, of course. 


10.3.2 The Hydrogenic Atom in a Magnetic Field 


Consider one Dirac particle in the Coulomb field of a nucleus of charge Z. 
For simplicity, we set A = 0 for the moment. The appropriate Hamiltonian 
describing this system is 


Za 
H = Do - —. 
|x| 
There are two ways to proceed. One is to try to find the spectrum of H on 
the full Hilbert space L7(IR*;C*). This can actually be done explicitly, see, 
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e.g., [16]. As long as Za < 1, one finds that there are discrete eigenvalues in the 
interval [0, m), and these look very much like the spectrum of the non-relativistic 
hydrogen problem with Hamiltonian —(1/2)A — Za|x|~!, but shifted upward 
by m. In addition, there is continuous spectrum on (—oo, m] U [m, 00).° As 
long as Za < 1, there is no need to make an a priori restriction of the allowed 
function space to the positive spectral subspace of Do. This is the approach 
usually taken in the analysis of one-electron atoms. It corresponds to the Furry 
picture discussed above. It has the conceptual disadvantage that the relevant 
Hilbert space changes when someone moves a nucleus (although it has to be 
admitted that many people do not regard this as a problem). The main point here 
is that there is no simple variational (i.e., energy minimization) principle for H. 
Even for the simple hydrogen atom with Za < | the ground state energy can 
not be found by minimizing (wv, H y) but only by an exact calculation of the 
spectrum of H./ 

An alternative approach is to consider the Brown—Ravenhall model, where 
one restricts the allowed wave functions to the positive energy subspace of Do. 
In this way, one has a variational principle for the ground state energy. Although 
the spectrum of hydrogenic atoms does not come out quite as accurate as in the 
first approach, the latter is much more suitable for an analysis of the stability of 
matter. As a warm-up, let us apply this approach to hydrogenic atoms. That is, 
we are looking for 


Ey = inf [(W, Dow) — Za(y, |x|"'y)]. (10.3.2) 
wen, Iv ll2=1 


Lemma 10.3 (Stability of Hydrogen). There is a critical (Za),, with 2/m < 


(Za). < 4/m, such that Eo in (10.3.2) satisfies Ey > —co for Za < (Za), and 
Eo = —o for Za > (Za),. 


The value of the critical Za can actually be computed explicitly. It is given 
by (Za), = 2/(7/2 + 2/7). Note that this number is bigger than 2/z but less 


As long as Za < 1, A can be defined as a self-adjoint operator on a natural domain containing 
all infinitely differentiable functions of compact support, as was recently proved by Esteban 
and Loss [53]. Self-adjointness fails for Za > 1. The existence of a critical value of Za for 
the Dirac Hamiltonian is analogous to the case of | p| discussed in Chapter 8 where the critical 
value of Za was 2/z. 

In the one-particle case, there are min-max principles for the eigenvalues of the Dirac operator 
in the interval (—m, m). See [80, 52]. 
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than 1. The computation of (Za), was a challenging problem which was solved 
by Evans, Perry and Siedentop in [54]. Improved bounds on the energy were 
later given in [181], and a generalization to the case of one-electron molecules 
is in [5]. 


Proof. The fact that Eo is bounded from below as long as Za < 2/z follows 
immediately from Lemma 8.2 in Chapter 8, together with the observation in 
Eq. (10.3.1) that Do acts as multiplication by (277k)? + m? on the Fourier 
transform of functions in Hj. 

Consider now the case Za > 4/z. For any E <0, we can find a function 
wv € L?(R?;C*) with ||v|/2 = 1 such that 


Za 4 
(w, |Dolw) — a Ix| p) < E. 


This follows again from Lemma 8.2 in Chapter 8, where it was shown that 2/7 
is the critical coupling constant for stability of |p| — Za|x|~'. Let ws. denote 
the projection of w onto 7H). We note that ||W||2 < 1. Then (yw, |Do|w) = 
(w+, |Dolwi) + (W_, |Dolw_) and, by Schwarz’s inequality, 


(Ww, |x| Ww) < 204, lel Wy) + 20, [x W). 
We thus conclude that 
(W4, [Dol — Zolx|~'W4) + (WE, |Dol — Zalx|"'W_) < E. 


One of the terms on the left side has to be less than E£/2. Without loss of 
generality, we can assume it is the first term; if not, simply replace yw by Uw 
with U given in (10.1.5) in order to exchange y, and w_. Since EF was arbitrary, 
this shows instability for Za > 4/r7:. | 


In combination, the results of this and the previous subsection imply the 
following lemma concerning hydrogenic atoms in magnetic fields. 


Lemma 10.4 (Hydrogen in a Magnetic Field). For both a and Za small 

enough, 

inf {(Y, Dat) — Zar, lel 'W) + Emag(B) + € HG, MWe = 1, A € A] 
(10.3.3) 


is finite. If either a or Za is too large, it is —. 


We shall not trouble the reader at this point with precise estimates on the 
critical parameters for the single electron but will turn our attention to the 
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many-body problem instead. It will turn out that the Brown—Ravenhall model, 
as defined using the positive spectral subspace of Do, is inherently unstable of 
the first kind! This result is not unexpected since there is no positivity condition 
available to control the N terms a a ; - A(x ;). Historically, the instability was 
first proved in [127] with the aid of the negative potential generated by M nuclei, 
with M >> 1. It was later realized by Griesemer and Tix [81] that the nuclei were 
not needed! The @ ; - A(x ;) terms alone can cause collapse if N is large enough. 
This analysis will be summarized in Section 10.5. 

The modified Brown—Ravenhall model, where the electron Hilbert space is 
defined as the positive spectral subspace of Da, turns out to be stable for a and 
Za small enough. We shall show this in the next section. 


10.4 Stability of the Modified Brown—Ravenhall Model 


The Hamiltonian for N electrons is 


N 
H => Daj +aVe(X, R) + Emag(B), (10.4.1) 


i=1 


with D4 defined in (10.1.1) and Vc the total Coulomb potential given in (2.1.21). 
Let aS denote the projection onto the positive spectral subspace of D4, and let 
Hi = A{L?(R?; C*) denote the corresponding subspace. For N electrons, the 
relevant Hilbert space is now 


N 
Ha (iG 
i=1 
and the ground state energy is 
Eo = int Ge Hy): wed, |lvll2=1, A € A}. (10.4.2) 


As in the previous chapter, we could allow for spin degrees of freedom in 
addition to the 4 naturally present in the one-particle space L*(R*; C*), but we 
shall refrain from doing so for simplicity. The choice of the Coulomb gauge, 
i.e., div A = 0, in A is not important because of gauge invariance, which holds 
in this model but which does not hold if one considers the positive spectral 
subspace of Do as the one-particle subspace, instead of the one of Da, as we do 
here. 
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The following theorem was proved by Lieb, Siedentop and Solovej in [127]. 


Theorem 10.1 (Stability of the Modified Brown—Ravenhall Model). Let 
K(q, Z) be given by 
1 Zu 
=—— $3,969 P77 + 1.039", 10.4.3 
(oD 2 q q ( ) 
Then Epo in (10.4.2) is non-negative for all N and M and nuclear positions 
R,,..., Ry as long asa < x(2, Z) and 


a? 82 1 
NaTalt—-—==}  2=: 10.4.4 
. ( K(2, =) = 85 eet 


For the physical value of a ~ 1/137, this proves stability for all Z < 56. 
For Z = 1, on the other hand, stability holds for all a < 0.117. It is still an 
open problem to show that stability holds up to the expected critical value 
Z = 2a7!/(2/m + 2/2) © 124 for the physical value of a. A proof that stability 
holds up to Z = 88 is given in [91]. 

The quantity «(q, Z) in (10.4.3) is the same as in (9.5.2) in the previous chapter 
and it is the lower bound on the critical value of a for stability of relativistic 
matter obtained in Chapter 8, Theorem 8.1. As in the previous chapter on the 
Pauli operator, that theorem will play a crucial role in the proof of Theorem 10.1. 

The following proof follows [127] closely. 


Proof. As in the proof of Theorem 9.2 in the previous Chapter 9, the first step 
is to use the relativistic stability of matter, which states that 


N 
V(X, R= -— 1p, + VeACr) 
i=1 

for all « < x(q, Z) defined in (10.4.3). A priori, one might think that g = 4 
here, but it is actually enough to use g = 2. This can be seen by the following 
argument, which is due to M. Loss and is given in [127, Appendix B]. It turns 
out that the spin-summed one-particle density matrix y" of any y € H, defined 
in (3.1.32), is bounded by 2 and not only by 4 as one might naively think. In fact, 
since the unitary U in (10.1.5) maps the positive spectral subspace of D4 to the 
negative spectral subspace (and vice versa), the one-particle density matrix y“) 
of w satisfies yp“) + Uy ut < I, and hence 


1 
yO = Trey = sires (y® 4 Uyut) <2. 
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The only place g enters in the proof of Theorem 8.1 on relativistic stability is 
via the kinetic energy inequality in Corollary 4.1, which depends on the largest 
eigenvalue of the spin-summed one-particle density matrix y. Since this largest 
eigenvalue is bounded by 2 in the present case, the effective g is thus 2 and 
not 4. 

In this way, our stability problem reduces to finding a bound on the sum of 
the negative eigenvalues of the one-particle operator 


h= N4 (Da - |p + VeAl) AX = AXSAG, 
K 
where we denote 
a 
S = |Dal - we + JaAl. 


Like any self-adjoint operator, S can be decomposed into its positive and negative 
parts. That is, it can be written as S = S, — S_, with Sx positive operators, and 
S,S_ = S_S; = 0. If we neglect the positive part S,, the sum of the negative 
eigenvalues of AGS AG, will only decrease, and hence 


Tih. =< TrA, SA). 


Note that USUt = S, with U given in (10.1.5). Hence also US_Ut=S_. 
Moreover, since UAtUI =A, =I- A, we have 


1 
TA, SA = 5 irs. 
The next step is to use the BKS inequality, Theorem 9.4. It implies that 
a a 1/2 
Tr S_=Tr [Pal — —|P + JaAl| <Tr [iar - IP + vaar| 
Recall that |Da|? =[o-(p+/a@A)?? +m? =(p+ J/aAy+/J/ao-Bt+ 


m?. For an upper bound on the square root of the negative eigenvalues, we 
can use m? > 0 as well as o - B(x) > —|B(x)|. Hence 


ve: 1/2 
Tr [iar aera vaar| 


a _ 1/2 
= Tr (: = =<) Ip + /@Al? — vaiBtx)| 
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The LT inequality, Theorem 4.1 for y = 1/2, implies that the latter trace is 
bounded from above by 


2 1/2 
Tr (: : <) pte vais) 


@2\ 73/2 
< 4a (1 - “) Lipa | |B(x)|7dx. 
K 
R3 


(The factor 4 comes from the C* part of the trace.) 
Recall that we still have the positive magnetic field energy at our disposal, 
namely (877)! dia |B(x)|?dx. Non-negativity of the ground state energy Ey thus 


holds as long as 
es L an 
(04 =—e 
2 1/23 S 8x 


As explained in Chapter 4, L1/2,3 < 0.0683. Inserting this bound for L /2,3 yields 
(10.4.4). | 


10.5 Instability of the Original Brown—Ravenhall Model 


In the previous section, we showed that the modified (gauge-invariant) Brown— 
Ravenhall model is indeed stable of the second kind, if @ is sufficiently small 
(independent of NV). The original Brown—Ravenhall model, in contrast, is not 
even stable of the first kind, as we shall see in this section. 

As in the previous examples of instability, it was originally thought that the 
negative part of the Coulomb interaction energy was necessary for instability 
of the first kind [127]. Later it was realized by Griesemer and Tix [81] that the 
nuclei were not needed. The instability is solely caused by the interaction of the 
electrons with the magnetic field, that is, by the w - A term in the Dirac operator. 
What they showed is the following. 


Theorem 10.2 (Instability of the First Kind for the Original Brown—Raven- 
hall Model). There is a constant C > 0, independent of m, N and a, such 
that 


N 
mt { (vd 2, Pai 1) Ea >we AH, Iwil=1, 464 


i=1 


as long as Na?!” > C. 
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In particular, the original Brown—Ravenhall model is unstable for any a > 0 
and WN large enough. This is true also with the inclusion of the Coulomb energy 
Vc, but we shall not consider this case here. The following proof of Theorem 10.2 
is inspired by [81]. We shall be brief at some points and resist the temptation of 
giving an estimate of the constant C. It was shown in [81] that C < 1.4 x 10°. 


Proof. The first point to note is that we can set m = 0. As discussed in the proof 
of Lemma 10.2, it suffices to find a y € a Fy (form =0) and an AeA 
such that (y, pe Da) + Emag(B) < 0. The energy (with the m) can then be 
driven to —oo by scaling, since all the terms in the energy (except for m) scale 
like an inverse length. 

The strategy now is to find N orthonormal functions in 3 such that the 
kinetic energy of the corresponding Slater determinant (defined in Chapter 3, 
Section 3.1.3) has a kinetic energy proportional to N*/?, which is what would 
be expected from semiclassics of relativistic electrons in a box of size 1 (see 
Chapter 4, Section 4.1.1). Moreover, one tries to find a fixed A (independent of 
N) so that the expected value of a a; - A(x ;) is order N. The field energy is 
then fixed, independent of NV. If we multiply the chosen vector potentials A by 
i, the total energy is thus of the order N*/? — const A./a@N + const A?. With the 
choice A = c,/aN for an appropriate c > 0, this becomes N*/? — constaN?, 
which is negative for w*/?N large enough, as claimed. 

A possible choice is the following. Consider one-particle wave functions in 


Te of the form 
u 
Wy) = oP y ’ 
|P| 


where u(x, 0) = f(x)do,1. We choose N orthonormal functions f, € L?(R?) to 
be the N lowest eigenfunctions of the Dirichlet Laplacian in a cubic box of length 
1, multiplied by a factor e'”* for some w € R*. We assume that || f,,||? = 1/2, 
so that the corresponding #’s are normalized to 1. The kinetic energy of the 
Slater-determinant made up of these N functions is of the order NV 473 4. larlN. 

The vector potential A we choose is one that is constant (in the z-direction) 
inside the unit cube, and appropriately defined outside the unit cube so as to 
make it divergence free and have finite magnetic field energy.* The contribution 


8 One way to imagine such a field is to consider a finite circular pipe though which water is 
flowing, in such a way that the velocity field close to the portion near the central axis has a 
constant magnitude and then decays smoothly to zero towards the boundary of the pipe. The 
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of 7, @; - A(x;) is then 


N — 3 
20> | fico? Pap (10.5.1) 


n=l p3 


where a denotes the magnitude of A inside the unit cube. The expression 
ry See | fi ( p)|* is just the momentum distribution of the system which, for 
large N, is approximately given by o((p — w)/N'/?) for some fixed @ with 
{ o(p)dp = 1. If we choose |@|, which is at our disposal, to be equal to a large 
number times N!/?, (10.5.1) is of the order aN, as desired. This concludes the 
implementation of our strategy outlined above, and thus the proof. a 


10.6 The Non-Relativistic Limit and the Pauli Operator 


The Hamiltonian with the Pauli kinetic energy considered in Chapter 9 is said 
to be the non-relativistic limit of the models considered in this chapter. ‘Non- 
relativistic limit’ in our units means that we let m — oo with fixed a. We shall 
consider this limit in the three Hilbert spaces discussed in Section 10.2. 


e The Furry Picture. For one particle in an external potential V, the Furry 
picture simply means that we are looking at the positive spectrum of the full 
Dirac operator D4 + V. We look for a solution of the Schrodinger equation 
(Dat+V—E)W =0, with w € L?(R;C’). If we write 


_ (ue) 
W(x) = tS (10.6.1) 


for u, £ € L?(R?; C”), this equation becomes 
o-(p+J/aA)e+ Vu =(E—m)u 
o-(p+JSaAu+VE=(E+mye. (10.6.2) 


For energies E ~ m + O(1) (for hydrogen, EF ~ m(1 — 10->)), we can ignore 
the term V2 in the second equation for large m, and thereby obtain ¢ ~ 


velocity field of the water will be our A field. Now we bend the pipe into a torus, but keeping 
part of it unbent. If the unit cube is inside the unbent part, the velocity field inside it will be 
uniform, as desired. 
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o -(p+ /aA)u/(2m). Inserting this into the first equation yields 


1 
5, lo (P+ Ja A)} u+ Vu =(E—m)u, (10.6.3) 


which is the Schrodinger equation with the Pauli kinetic energy. For a precise 
mathematical analysis in terms of convergence of the operators in resolvent 
sense (and also higher order corrections) we refer to Thaller’s treatise [177, 
Chapter 6]. 

The Original Brown—Ravenhall Model. For large m, the lower component 
of the wave function in the positive spectral subspace of Do is related to 
the upper one simply by €*o - pu/(2m), see Eq. (10.1.6). For functions 
in the positive spectral subspace of Do, the Dirac operator Do simply acts 
as \/p? + m2 = m+ p*/(2m). Hence (W, Dow) © (u, m + p*/(2m) u). The 
expectation value of w - A in (10.6.1) equals 


1 
Fm Ue Ke - Pe - A) + - Ao - p)]u) 
m 


in this case. A simple computation shows that (o - p)(o -A)+(o- A) 
(o-p)=p-A+A-p-+o- B. Hence we recover all the terms in the Pauli 
operator, except for aA?/(2m). 

The conclusion is that we recover the Pauli—Fierz model (discussed in the 
footnote on page 21) as the non-relativistic limit of the Brown—Ravenhall 
model, and not the true (gauge-invariant) Pauli kinetic energy. This defect is 
closely related to the instability of the Brown—Ravenhall model discussed in 
this chapter. 

The Modified Brown—Ravenhall Model. On the positive spectral subspace 
of Da, the Pauli operator D4 acts as |D4| = /[o -(p + /a@A)]? +m? & 
m+ [o-(p+/aA)]?/(2m). Hence we immediately recover the full Pauli 
operator (10.6.3) in the large m limit of the modified model, thus remedying 


the defect just mentioned. 


CHAPTER 11 


Quantized Electromagnetic Fields and 
Stability of Matter 


So far we have treated the magnetic field B as a (classical) external field, with 
an energy Emag(B) = (8x)7! f | B(x)|°?dx. We would now like to broaden our 
horizon and treat the electromagnetic field in its proper role as a dynamical 
quantity. Moreover, we would like to go even further and quantize this field in 
the canonical way, which will automatically give us the quantization of light, as 
discovered by Planck in 1900 [149]. The Hilbert space for this revised quantum 
mechanics will be larger than the L* spaces considered before, and will involve 
Fock space. The stability of matter questions will become more complicated in 
some ways and will have to be revisited. 

The ultimate version of quantum electrodynamics is even more complicated. 
The number of electrons is not fixed, but fluctuates. Electrons are defined by 
a quantized field and can be created and destroyed. We shall not venture into 
this realm, however, and will continue to regard the electron number N as 
fixed. 

We begin with a review of electromagnetism, which the knowledgeable 
reader can safely skip. A good reference for further details is Spohn’s treatise 
[174]. 


11.1 Review of Classical Electrodynamics and its Quantization 
11.1.1 Maxwell’s Equations 


In classical electromagnetic theory, the interaction of charged particles is medi- 
ated by fields, which is to say (possibly time-dependent) functions on R?. There 
are six in fact, consisting of two (three component) vector fields E(x) and B(x), 
the electric field and the magnetic field. These satisfy the following equations, 
known as Maxwell’s equations. With the notation df (x, t)/dt = f (x,t), they 


200 


11.1 Classical Electrodynamics and its Quantization 201 


div E = 470 culB =47j+E (11.1.1) 


divB =0 curl E = —B. (11.1.2) 


The two equations (11.1.1) are the inhomogeneous Maxwell equations and the 
two equations (11.1.2) are the homogeneous Maxwell equations. The function 
@ is the charge density and the vector-valued function j is the electric current 
density. The two are related by the continuity equation 


6 +div j =0, (11.1.3) 


which expresses the conservation of charge, that is, the fact that the total charge 
is independent of time. If (11.1.3) did not hold, Maxwell’s equation would be 
inconsistent because 477@ = div E = —4z div j, since divcurl B = 0. In our 
case, the charges are point charges of electrons and nuclei, so @ is a sum of 4 
functions, 


oy) =>) Qid(y—x;) and j(y)= >> Oit8(y—x;), (11.1.4) 


where Q; denotes the charge of particle 7. Maxwell’s equations have to be 
understood in the distributional sense (see [118]). 

The fields FE and B are generated by the moving charges x;(t), according to 
(11.1.1). In turn, they yield the electromagnetic forces on the particles, which 
are 


F, = Q; (Ei, t) +x; A BX), 1). (11.1.5) 


(The force on the particles is really ill-defined because of the self-interaction of 
the particles, i.e., the interacting of a particle with the field it creates. One can 
get around this difficulty by imagining the point charge to be smeared out a bit, 
but we shall ignore this problem for the moment.) 

The homogeneous Maxwell equations (11.1.2) are constraints on the possible 
values of the fields which imply that the fields can be described economically 
by introducing potentials. More precisely, for a scalar (real-valued) potential 
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(x, t) and a vector potential A(x, t), we can define 


E(x, t) = —Vo(x, t) -— A(x, t), B(x, t) = curl A(x, fr), 


(11.1.6) 


and observe that Eqs. (11.1.2) are automatically satisfied. All fields E and B 
satisfying (11.1.2) can be written in this way. This follows from the fact that any 
divergence-free vector field can be written as a curl, and every curl-free vector 
field as a gradient. The equations (11.1.1) become 


—A@ — div A = 420 (11.1.7) 
and 
VdivA—AA+A =4nj — Vo. (11.1.8) 


Note that E and B do not determine A and ¢ uniquely. In fact, there is a huge 
freedom in choosing the potentials. Namely, if A and ¢ are potentials giving rise 
to fields E and B via (11.1.6), so do 


A(x, t)=A(x,))-Vxx,0, (x. 0=o(%,0+%x,1) (11.1.9) 


for an arbitrary function x. One can thus impose restrictions on the potentials in 
order to remove this ambiguity. Such restrictions are called gauge conditions. 

Maxwell’s equations (11.1.7) and (11.1.8) simplify considerably in the 
Coulomb gauge where div A = 0 for all times (compare with Section 10.1.1). 
There is always a choice of A satisfying this gauge condition. Namely, for gen- 
eral A and ¢ define x by Ay =div A. Then obviously A’ and @¢’ in (11.1.9) 
give rise to the same fields E and B, and div A’ = 0 for all times. 

In Coulomb gauge, the equations (11.1.7) and (11.1.8) become 


—Agd = 470 (11.1.10) 
and 
—~AA+A=4nj —V¢. (1.1.11) 


The scalar potential ¢ is thus given by the instantaneous Coulomb potential 


way f PPP gy, (11.1.12) 
, |x — y| 
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Using the continuity equation (11.1.3), we see that 
4nj —Vb=4n Pj (11.1.13) 


where P denotes the projection onto divergence-free vector fields. Explicitly, 


1 div j(y,t 
(Pen = jens cv [SO Pay (1.1.14) 
An |x — y| 
R3 
or, in momentum space, 
. 3 kik! 

Pj)'(k,t) = Sa = | PR a: 11.1.15 
(P7)'(k, t) > (a a) i ) ( ) 


f=] 


To summarize, in Coulomb gauge the scalar potential ¢ is explicitly given 
by (11.1.12), and the vector potential A satisfies the equation 


~AA+A=4n Pj. (11.1.16) 


The fields E and B are given in terms of these potentials by (11.1.6). 

Equation (11.1.16) can be interpreted as the evolution equation of coupled 
harmonic oscillators, at each point x, driven by an external force. These harmonic 
oscillators are decoupled in Fourier space, where (11.1.16) becomes 


A(k, t) + |20k|2A(k, t) = 4 Pj(k, 1). (1.1.17) 


For every k, this represents an independent harmonic oscillator with frequency 
|kK| (or angular frequency 27:|k|). Later on we shall quantize these harmonic 
oscillators in the canonical way thus leading to Planck’s 1900 law [149], which 
equates the energy of a photon of frequency 27 |k| with the excitation energy of 
the corresponding harmonic oscillator. Note that the Coulomb gauge condition 
becomes k- A(k, t) = 0 in momentum space. That is, A is perpendicular to the 
direction of k. In particular, for each k there are only 2 independent degrees of 
freedom, transverse to the direction of propagation k. 
The electromagnetic field energy equals 


ae f (Bee? + Ben?) dx. (11.1.18) 
R3 
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This is the amount of work against the forces (11.1.5) to take well separated 
massless infinitesimal charges at rest and bring them into the stated conditions of 
position and velocity at time r. Using (11.1.6) and the Coulomb gauge div A = 0, 
this energy, (11.1.18), becomes 


1 
= / (A(x, t)? + (curl A(x, t))’) dx + D(o, @) 
R3 
1 A oa 
= | (Ak, 17 + |2rk|? A(k, 1) dk + D(o,e@), (11.1.19) 
a 
R3 
where D(o, Q) is the self-energy of the charge distribution 0, as defined in 
(5.1.4). The mixed term { V@ - A disappeared by integrating by parts and using 
the Coulomb gauge condition. If we want to interpret (1 1.1.17) as the equation 
of motion of a harmonic oscillator of angular frequency 27|k| and energy 
given by the integrand in (11.1.19), then we have to identify (4)! A(k, t) 
as the ‘momentum’ corresponding to the ‘position’ coordinate A(k, t).! The 
electromagnetic field energy can thus be thought of as the energy of independent 
harmonic oscillators, two for every momentum k corresponding to the two 
transverse degrees of freedom. 


11.1.2. Lagrangian and Hamiltonian of the Electromagnetic Field 


The analogy with the harmonic oscillators suggests a way to derive a Hamiltonian 
description of the electromagnetic field, and ultimately for the field plus the 
matter. To do this systematically, we follow the standard route of first starting 
with the Lagrangian point of view. With E and B given as in (11.1.6), the 
Lagrangian is 


cid, a, A= f | = (E(x, t) — B(x, t)’) 


R3 


+ j(x,t)- A(x, t)- atx orn dx. (11.1.20) 


' This is not to be confused with the momentum of the electromagnetic field, which is proportional 
to Poynting’s vector E A B. The words ‘momentum’ and ‘position’ are used here merely to 
make an analogy with an oscillating spring. 
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It can be readily checked that the Euler-Lagrange equations for this Lagrangian 
yield (11.1.1). While £ involves the time derivative of one of the variables, A, 
as it should, it does not involve involve the time derivative of the other variable, 
@. Consequently the Euler-Lagrange equation for @ is really just a constraint 
equation, given by 


—div (A+ V¢) = 470, (11.1.21) 


which is the same as (11.1.7). 

As already mentioned in the previous subsection, there is still some ambiguity 
in the choice of A and, at this point, it is convenient to work in the Coulomb 
gauge div A = 0. In particular, div A = 0, and hence Eq. (11.1.21) becomes 
Ag = —4zsq. The scalar potential ¢ is thus given by the instantaneous Coulomb 
potential (1 1.1.12). Eliminating ¢@ from the Lagrangian (1 1.1.20) with the aid of 
(11.1.12) and again using div A = 0, the Lagrangian becomes 


L(A, A) = / E (A(x, 1)? — B(x, t)?) + f(x, t)- A(x, | dx — D(o, Q). 


R3 
C1149) 


The last expression is just the (instantaneous) electrostatic energy of the charge 
distribution g. In case g is the sum of 6 functions, as in (11.1.4), this energy is 
really infinite owing to the infinite self-energy of a point charge. After subtracting 
this infinite self-energy, the electrostatic energy becomes 


Ds 219i (11.1.23) 

ae |x; = x j| 

i<] 

The scalar potential @ has thus been eliminated from the problem, and A is 

the only dynamical variable. The canonically conjugate momentum to A is 
_b£ it 
6A 4x 

The Hamiltonian corresponding to the Lagrangian £ reads 
1 

H(A, I) = / an I(x, t) + ap BO ty —j(x,t)- Ar, | dx + D(o, @). 
us 

R 
(11.1.24) 


In particular, we recover the magnetic field energy E€mag(B) = (87)! 
ie B(x)?dx. The reason that the additional IT” term has not appeared so far is 
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that we were always interested in minimizing the total energy, and for that pur- 
pose one can set JT = 0, classically. This is analogous to minimizing p* + V(x), 
in which case V(x) is minimized and p = 0. When the field is quantized, it will 
be important not to drop the IT” term, as we shall see, just as it is important not 
to drop the p* term when minimizing the Schrédinger energy p? + V(x). The 
last term in (11.1.24) is the electrostatic energy in (11.1.23), after subtracting 
the infinite self-energy. 

This Hamiltonian description of the electromagnetic field and its interaction 
with charges and currents is manifestly not relativistically invariant. It has to be 
recomputed after each Lorentz transformation. Nevertheless, it seems to be the 
only way to give a Hamiltonian formulation of the fields and their energies. It is 
used, as we said, in actual calculations of the spectra of atoms and molecules. 

There does exist a relativistically invariant formulation using the ‘Lorentz 
gauge’ instead of the Coulomb gauge, in which ¢(x) appears in addition to 
A(x). In the Lorentz gauge there are constraints between ¢ and A, however, 
which makes the theory complicated, especially when an attempt is made to 
quantize it. Nevertheless, the Lorentz gauge is the most convenient one to use 
for the perturbation-theoretic treatment of scattering of particles. The Coulomb 
gauge is convenient for bound state problems, especially the calculation of the 
ground state energy. A good discussion of the relationship of the two gauges 
can be found in Heitler’s book [87, Chapter 1, see §6]. An early article that 
established the importance of the Hamiltonian in the Coulomb gauge is Fermi’s 
[63]. 

In the Coulomb gauge, which we use here, the div A = 0 condition means that 
A is constrained to be transversal (meaning that in Fourier space k - A(k) =0 
for every k). Hence there are only 2 degrees of freedom (the two transverse 
polarization vectors) at each k point. The same applies to I(x). Consequently, 
since both A and 77 are real-valued, we can represent them in terms of a complex 
valued function a(k, i) as 


2 
1 . ei 
A(x) =) / —— @,(k) (e77"**a(k, A) + e277" *a(K, A)) dk (1.1.25) 
ie, wl ( ) 


a 
T(x) = -5 - / Jikl ex(k) (e7*ak, a) — eo 2tiex aE, *) dk. 
A=1 p 


(11.1.26) 
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The polarization vectors €,(k) and e2(k) are orthonormal and orthogonal to k, 
i.e., €,(k) - k = 0. Their choice is not unique, of course. They can not be chosen 
to be continuous functions of k, however (because one cannot ‘comb the hair on 
a sphere’).” In terms of this complex valued function a(k, A), the magnetic field 
B = curl A is expressed as 


2 
201 ‘hex Sat 
B(x) = / —~k A @,(k) (e°""**a(k, 4) — e77"**a(k, A)) dk. 
DI) amt ) 


(11.1.27) 


It is convenient to factor out the terms 1/./|A| and ./|k| in the integrands in 
(11.1.25) and (11.1.26), respectively, since the field energy can then be simply 
written as 


2 
1 
[| T(x, t) + ap BO | dx = Y f[ emtitac. d)P-dk. (1.1.28) 
uA 
R Asli 
The fact that IT and A are canonically conjugate variables implies that the 


Poisson brackets between a(k, 4.) and a(k, i) are given by 


{a(k, A), a(k’, X/)} = b,,5(k — k’) 


{a(k, A), a(k’, AY} = {alk, A), alk’, r’)} = 0. (11.1.29) 


11.1.3. Quantization of the Electromagnetic Field 


We now follow the standard rules of quantization. The canonical variables a(k, i) 
and a(k, 4) become operators a(k, A) and a'(k, A) ona Hilbert space, satisfying 
the canonical commutation relations 


lak, 2), al(k’, A] = 5,,/8(k — k’) 


[a(k, 2), a(k’, A’)] = [al(k, d), al(k’, 4] = 0. (11.1.30) 


Also A and IT become operators on this Hilbert space. They are still given 
by (11.1.25) and (11.1.26), with the functions a(k, A) and a(k, 4) replaced 
by the corresponding operators a(k, A) and ai(k, A), respectively. A natural 


? The lack of continuity can lead to spurious technical problems. One way to restore continuity 
is to make use of an artificial third component e3(k). See [73, 120]. 
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representation is in terms of Fock space [67]. It is given by 
P= precy 
N>=0 


where L?(R?; C2)8s stands for the N-fold symmetric tensor product of 
L?(R?; C), ie., the totally symmetric square integrable functions of N vari- 
ables. The sum runs over all non-negative integers N, which are interpreted as 
photon numbers. For N = 0, L?(R°; C22 =C by definition. The correspond- 
ing vector is called the vacuum, denoted by Q. It is annihilated by all the a(k, A), 
that is, 


a(k,aA)Q=0 forall keR* and Ae {1,2}. 
A vector in F can be thought of as a sequence of functions 
VW = {Wo € C, Wilk, A), Waki, Ar, 2, A2),-.- 
The inner product on F is then 
Cc 
(©, ¥) = ) (bn, Ww). 


N=0 


The annihilation operator a(k, 4.) maps a function of N variables to a function 
of N — 1 variables. Explicitly, 


(a(k, Aww) (Kr, Ay... Ky —t, Aw) = VN (ki, A, ee Kaa, Ani, k, A). 
Their adjoints are called the creation operators, which act as 
(al (k, Ww) (1, Ads... Kv, Ana) 
= JN+18 Wyk, At, -. +. ky, AWS(Kn4t — Say cir 


where S stands for the symmetrization operator, i.e., 
1 
Sf(ki,A1,...,kn, Aw) = NI SS f (Rac), Ana), «++, Kx), Anny): 


the sum being over all permutations of the N variables. It can be readily checked 
that at(k, A) is the adjoint of a(k, A), and that the canonical commutation 
relations (1 1.1.30) are satisfied. 

Strictly speaking, a(k, 4) and a'(k, A) are not operators but operator-valued 
distributions. They can be made into bona fide (although still unbounded) 
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operators by smearing them out with test-functions in the usual way. For- 
mally, this means considering operators of the form f»; f(k)a(k, A)dk instead 
of a(k, 4), with f infinitely differentiable and of compact support. See [150, 
Sect. X.7] for a proper definition. Alternatively, one could confine the electro- 
magnetic field to a large but finite box, in which case the k values would become 
discrete and the Fourier integral becomes a Fourier series. Then a(k, A) is a 
genuine operator. 

In analogy with (11.1.28), the electromagnetic field energy is given by the 
operator 


2 
Hy = Df amkiatar. Aa(k, A)dk. (11.1.31) 
A=1 3 


The precise analogue of the left side of (11.1.28) would actually have 
(1/2)(ata + aa’) in the integrand, which differs from (11.1.31) only by an 
infinite constant, which is of no physical significance here. The choice (1 1.1.31) 
corresponds to an energy scale in which the vacuum has zero field energy, which 
is a natural condition. 

In the absence of matter, the Hamiltonian (1 1.1.31) describes the full energy of 
the quantized electromagnetic field. In particular, it reproduces Planck’s seminal 
discovery [149] that the electromagnetic radiation field is quantized in terms of 
photons, whose energy equals hv = hc|k| = 27 |k| in our units, where h = 
2h = 2m and c = 1. Here, v is the frequency and 1/|k| is the wavelength. 
Moreover, the theory is linear in the sense that the energy of N photons is the 
sum of the individual energies. This follows from (1 1.1.31), since Hy acts ona 
W e F by multiplying wy by 27 pan |k;|. It is noteworthy that the additivity 
of the energy holds even if the photons have the same value of k. Different 
values of k refer to different, independent oscillators, so it is not surprising that 
the energies are additive; the additivity for photons with the same k-value is a 
special property of the spectrum of harmonic oscillators. 

The interaction of the electromagnetic field with matter is described by the 
A field, which formally is given by (1 1.1.25). In the quantum case, this is not a 
well-defined operator, however. This can already be seen by applying A(x) to 
the vacuum (2. Each of the three components of A(x) is a one-photon state, 
with photon wave function @,(k)|k|~ '/2 This is nota square-integrable function, 
however! The integral over k diverges for large k, a phenomenon that is called 
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ultraviolet divergence. Hence A(x) can not be defined as an operator on F. 
Physically, this divergence is a consequence of the fact that we are looking at 
a fixed x, which is what we need in order to describe the interaction of the 
electromagnetic field with a point-particle located at x. If the particle were not a 
point but rather had a charge density described by x(x — Xo), then the relevant 
interaction would be 


| A(x)x(x — Xo)dx = 2 | eth) ( (k) ( er Maka) 


+e *bseal(k, a) dk. (11.1.32) 


As long as X(k)|k|~!/ is a square-integrable function, this expression yields a 
well-defined operator. 

The function ¥(k) in (11.1.32) can be interpreted as an ultraviolet cutoff. It 
satisfies ¥(0) = 1 and |X(k)| < 1 (since x(x) > 0 and f{ x(x)dx = 1). In the 
following, it will be convenient to simply choose x(k) = ©(A — |k|) for some 
A>0. 


11.2 Pauli Operator with Quantized Electromagnetic Field 


For non-relativistic spin 1/2 particles in a quantized electromagnetic field, the 
Hamiltonian resembles the one in Chapter 9, Eq. (9.5.1). It is given by 


N 
H =) [o;-(p, + VaA(x;))| + 0 Vo(X, R) + Hy, (11.2.1) 


i=1 


and acts on the Hilbert space [ ia L?(R?; C?)] ® F, where F denotes the photon 
Fock space described in the previous section. The vector potential A(x) describes 
the interaction between the radiation field and the particles. To simplify the 
notation, we take the ultraviolet cutoff function x(k) to be the characteristic 
function of a ball of radius A. The vector potential is then 


A(x) = » / Fae (¢ em alk, A) + eal (k, 2)) dk. (11.2.2) 


=Nki<a 
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The operators a'(k, A) and a(k, A) are the creation and annihilation operators of a 
photon of momentum k and polarization A, satisfying the canonical commutation 
relations (1 1.1.30). Recall that the field energy Hy is defined in (11.1.31). 

The reader might wonder how H, in (11.1.31) compares with the classical 
field energy Emag(B) = (877) : f |B(x)|?dx. Classically, there is no I(x)? term, 
since we considered the B field to be time-independent. Treating the B field as 
a dynamical variable, the field energy has the additional term 27 f |IT(x)|*dx. 
Naively one might think that the energy is now bigger because of this term, 
and the problem of stability of matter should become simpler. Alas, this is not 
the case. As remarked, I” plus B? really gives ata + aa', which we have 
abbreviated to 2a'a, thereby ignoring an infinite commutator.’ Thus, while 
a‘a continues to be positive, H f iS not necessarily bigger (and can be less) 
than the classical field energy Emag(B). The problem of stability of matter 
in QED becomes more complicated because Hy can be less than Emag(B). 
An additional difficulty comes from the fact that Hy does not commute with 
A(x). 

The quantized field B(x) is now an operator on Fock space, parametrized by 
x € R’, given by 


2 
B(x) = [= peclda A €,(k) (e277 * alk, 4) — 027 al(k, A)) dk 
Df Fakse (6 ) 


=Nikl<a 


(11,2:3) 


This B field is the quantization of the classical field in (11.1.27). As we shall 
see in Lemma | 1.1, it is possible to obtain a lower bound on H; in terms of the 
integral of B(x)* over a bounded set Q C R?. There will be an error term that is 
proportional to the volume of Q and to the fourth power of the ultraviolet cutoff 
A, however. 

The operators B(x) and A(y) all commute with each other, as explicit 
calculation shows. A simpler way to see this is to introduce the operators 
b(k, 4) = a(k, A) +. al(—k, A), which satisfy b'(k, A) = b(—k, 4). Clearly, they 
all commute with each other. If we choose the polarization vectors to satisfy 


3 In a finite volume, where the k values are discrete, the commutator would not be infinite but 
proportional to the volume. 
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€,(k) = e,(—k) for all k, then B(x) can be written as 


B(x) = k A @,(k)e7*"** bck, A)dk, (11.2.4) 


y | 


|k|<A 


and a similar expression holds for A(x). Therefore A(x) and B(y) are linear 
combination of the operators b(k, 4) and hence commute. 

The operators q(k,) = b(k,A)+b(—k,4) and g(k, A) = i(b(k, A) — 
b(—k, i)) are self-adjoint and thus have real spectrum. They all commute with 
each other. (As mentioned above, these are strictly speaking only ‘operator- 
valued distributions’, but we can regard them as genuine operators on a Hilbert 
space if we discretize k so that Fourier integrals become sums.) We claim that 
their joint spectrum is R?. That is, there is a representation on Fock space in which 
all these operators can be regarded as ordinary real numbers without restric- 
tion (except, of course, that g(k, A) = q(—k, A) and g(k, 4) = —q(—k, d)). 
To see this, introduce the conjugate operators p(k, A) = (1/4)[a(k, A) — 
a\(—k, 4) — a(—k, A) + al(k, A)] and p(k, A) = (i/4)[atk, 4) — al(—k, A) + 
a(—k, 4) — a'(k, A)]. The pairs p,q and p,q each satisfy the Heisenberg 
commutation relations [ p(k, 4), q(k’, A’)] = —id,,,/5(k — k’) and they commute 
otherwise. 

If we put all this together we see that there is a representation of Fock 
space (called the Schrédinger representation or ‘Q-space’, see [159] or [150, 
Sect. 10.7]) in which the Fourier coefficients of A(x) and B(x) are independent 
complex numbers. In other words, we can regard the A field and the B field as 
arbitrary functions (subject to the condition that their Fourier transforms vanish 
for |k| > A) with B = curl A. The field energy, however, involves IT, which is 
composed of the conjugate operators, p and p, and which do not commute with 
A and B. That is why the case of quantized fields is different from the classical 
case. If the field energy were (87r)~! f B(x)*dx instead of H,, the quantum case 
would simply reduce to the classical case. 

The following lemma will be important when considering the question of 
stability of matter for the Hamiltonian (11.2.1). As mentioned above, it allows 
for bounding the field energy H; from below in terms of the integral of B(x)’ 
over sets of finite volume. In this way, the problem of stability of matter with 
quantized fields can be reduced to the classical case. 
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Lemma 11.1 (Classical and Quantum Field Energy). For any function g € 
L\(R?) with O < g(x) <1 


Hy = a | |B(x)|? g(x)dx — 227 A+ / g(x)dx. (11.2.5) 
R3 R3 
This lemma was proved in [27, Lemma 3]. For a more general inequality of 
this type, see also [119, Lemma B.1]. 
Proof. Write B(x) = C(x) + C(x)', where C(x) denotes the part of B(x) in 
(11.2.3) containing only the annihilation operators a(k, 4). Then 


B(x)? = [C(x) + C(x)'P 
= 4C(x)! - C(x) + 2[C(x), C(x)'] + [C(x) — C(x)? 
< 4C(x)! - C(x) + 2[C(x), C(x)']. 


The commutator [C, C'] means yc i C'T]. It can be easily evaluated by 
using the canonical commutation relations (11.1.30). It is independent of x, and 
given by 
2 
(xy 
[C(x), C= > | (kn e,ydk 


fed |k| 
~ |k|sA 


= 227) / |k|dk = (27) A‘. 
|k|<A 


Moreover, 


2 
[co -canyax = (22> _ / |kla'(k, A)a(k, A) < 20 Hy. 
R3 A=l KIA 
Since 0 < g(x) < | the rest is obvious. | 


Recall that in the case of classical magnetic fields, stability of matter for the 
Pauli operator holds if and only if both a and Za? are small enough. This was 
shown in Theorems 9.2 and 9.3 in Chapter 9. In the case of quantized fields, a 
similar result holds, as we shall now describe. 

The first proof of stability of the second kind for the Hamiltonian (11.2.1) 
was given by Bugliaro, Frohlich and Graf in [27]. They used Lemma 11.1 in 
order to reduce the problem to the classical case, studied in Chapter 9. We shall 
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follow a similar route in the proof of the following theorem about non-relativistic 
stability. 

Let 6(Z) denote the maximal value of a such that the inequality (8.6.2) is 
satisfied for g = 2. It was shown in Lemma 8.6 that the inequality 


v2 (so + aACe)| = We) we (11.2.6) 
LBZ? ~ 8 aD 


holds for all normalized, fermionic wave functions wy € ita L?(R?; C”) and all 
0 < d < 1. The function W’ is defined in (5.4.1), and D; is half the distance of 
R to the nearest nucleus. Although Lemma 8.6 was formulated with classical 
vector potentials, it holds with quantized vector potentials as well, since these are 
equal to classical fields in the Schrodinger representation, as discussed above. 
That is, (11.2.6) holds for yw € LA” L?(R?; C?)] ® F, with A(x) the operator 
defined in (11.2.2). 

Inequality (11.2.6) will play a crucial role in the proof of the following 
theorem. 


Theorem 11.1 (Stability with the Pauli Operator and Quantized Fields). Let 
Z = max; Z; and assume that 


a? < 4313 (2), (11.2.7) 
872 


where B(Z) is defined just before (11.2.6). Let H be the Hamiltonian (11.2.1) 
and let y € [/\% L2(R3; C?)] @ F with (W, W) = 1. Then we have 


2 ; : 
citi z ow (Fo SE a (4:1) 


402 e B(Z)2 


for any £ > 0. 


For the physical value aw = 1/137, a numerical evaluation shows that our 
stability criterion holds for all Z < 854. (Compare with the case of classical 
magnetic fields, where stability was proved in Theorem 9.2 for all Z < 953.) 


Proof. Without loss of generality we can assume again that Z; = Z for all 
j =1,2,..., M. This follows from the monotonicity of the ground state energy 
in the nuclear coordinates, as discussed in Section 3.2.3. In fact, the result of 
Proposition 3.1 applies equally well to the quantized field case. 
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Let x be a nice function on R,, with x(t) =1 for t< 1, x@) =0 for 
t > 2and0 < x(t) < 1 in-between. For ¢ > 0 let n)(x) = x (min; |x — R;|/£) 
and (x) = /1—7;(x)?. The function 7, is 1 within a distance € from 
any of the nuclei, and 0 a distance 2¢ away from all the nuclei. Moreover, 
\Vinil? +|Vno|* < Ce for some constant C > 0. In fact, with the choice 
x(t) = cos((z/2)(t — 1)) for 1 <t <2 the constant C can be taken to be 
C=277/4 

Using the fact that n? + 5 = 1, it is easy to see that’ 


[o -(p+ J@A(x))]” = mix) [o -(p + VeA(w))] 71(x) 


+ no(x) [o «(p+ JeA(x))] n(x) 
— |Vni(x)]? — |Vno(x)/. (11.2.9) 


For a given we€ ih L?(R?;C?)] @F and i; € {1,2} for l1<j <N, let 
Ve = ee ni (x). Using (11.2.9) we can write 


2 2 
(WW, Hy) = SO Bi-ins HVi~in) 
i=l iy=l 


= (4 00 [Veep P+ 1Vm@)P] wv). 12.10) 


The last term is bounded from below by —NC£~?(y, W). 

We shall now derive a lower bound on (¥j,...iy, H Wi,...iy) for given values of 
ij,1 < j < N.Weemploy the improved electrostatic inequality of Theorem 5.5, 
which states that 


Vc(X, R) > -Lwent FL (11.2.11) 
i=I1 J 
Our choice for A will be the optimal value given in (5.4.3). For this value, 
W*(x) <(Z4+V72Z4+ 1/2)/ min; |x — R,|, and hence W*(x;) is bounded 
from above by (Z + V2Z+4+1/2)¢~! if x; is in the support of 72, ie., if 
|x; — R,| = ¢ forall 1 < k < M. In particular, 


FASTA 


(Ueaaes W* (xj) Wiy---in) = £ 


(Wismiys Wiywiy) (1.2.12) 


4 This is known as the IMS localization formula; see [36]. 
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if i; = 2. We shall use this bound for all the particles j with i; = 2. For the 
particles j with i; = 1, we shall use (11.2.6) instead, which implies that 


vz M 


>, wx) <2 ; XD, aD Y- [pp + VaA(x))). (1.2.13) 


j,ij=l j,ij=l 


1 


By combining (11.2.11)—(11.2.13) and dropping the positive kinetic energy for 
the particles with 1; = 2, we obtain the bound 


Wisin» H Wiy-iy) 2 Wiine His-in Wir-in) 


= ee Wiyriy) (112.14) 


Aes = se ( Oper JaA(x;))} - 


j,ij=l 


aa? + Yaa) + Hy. 


(11.2.15) 


Note that Ay, i 
i;=1, 
Using Lemma 11.1, with g being the characteristic function of the set 


y acts non-trivially only on the variables z; = (x;,0;) with 


={xeR: min |x — Ry| = 2€}, 
we conclude that 


OY" name. 


1 
Hy > = | Borer — 
87 


After integrating out the variables z; with i; = 2, the one- anaes density of 
Wi,...iy for the particles 7 with i; = 1 is supported in the set ©. Hence we can 
apply Corollary 9.1 to conclude that 


~ a2 (4) dieccté 
(Wi, iy» Hj, ....in Wi, iy) > AN GZ _ ae Me (Wi,-iy> Wi,-iy) 
as long as (11.2.7) holds. This bound is valid for all values of i;, 1 < j < N.In 
combination with (11.2.10) and (1 1.2.14), we arrive at (11.2.8) | 


We have thus proved stability for the Pauli Hamiltonian with quantized elec- 
tromagnetic field, for suitable values of w and Za?. It turns out that because 
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of the ultraviolet cutoff A the bounds on a and Za? are actually not necessary 
for stability of the second kind, i.e., stability holds for all values of a > 0 and 
Z > 0. This was shown by Fefferman, Frohlich and Graf [59] and, subsequently, 
by Bugliaro, Fefferman and Graf [26]. The same would be true for classical mag- 
netic fields in the presence of an ultraviolet cutoff. The proof of this fact utilizes 
Lieb-Thirring inequalities for the Pauli operators which involve gradients of 
B(x) and which are more complicated than the ones given in Chapter 4; a partial 
list of references on this topic is in Section 4.4. 


11.3. Dirac Operator with Quantized Electromagnetic Field 


The next step is to consider the system discussed in Chapter 10, where the 

kinetic energy is described by the Dirac operator. The difference now is that the 

electromagnetic field is quantized. We expect the conclusions to be essentially 

the same as before, and they will be, but significant questions of principle arise 

because the field is not just a function on R? but it is an operator on Fock space. 
Formally, the Hamiltonian of the system is given by 


N 
H = )° Daj +aVce(X, R) + Hy (11.3.1) 


i=1 


where D4 ; = Do, + w; - A(x;) is an operator that acts on L?(IR*; C*) @ F. The 
Dirac operator Do is defined in (10.1.1), and we use the standard units where 
m = 1. The Coulomb potential Vc is defined in (2.1.21) in Chapter 2, and the 
field energy Hy is given in (11.1.31) above. We note that the operators D4; do 
not act on different factors of a Hilbert space with tensor product structure (as 
they did in the case of classical fields), but they still all commute. The reason 
for this is that A(x) and A(y) commute for all x and y, as was explained after 
Eq. (11.2.3). Also Do,; clearly commutes with A(x ;) fori # j. Because of this 
commutativity, it is possible to define the joint positive spectral subspace. That 
is to say, all the projections A;* onto the positive spectral subspace of Da 
commute with each other, and hence Te 1 rs which does not depend on the 
order in which the product is taken, is again a projection. 

It thus makes sense to restrict the Hamiltonian to the joint positive spec- 
tral subspace of all the D4.;, given by ik 4 ASI) L?(R?; C*)] ® F). This 
subspace is not trivial and, in fact, infinite dimensional, as shown in [119, 
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Appendices C and D]. The resulting model is the quantized version of the mod- 
ified Brown—Ravenhall model discussed in Chapter 10. The physical Hilbert 
space for this model no longer has a tensor product structure, since there is only 
one photon Hilbert space (the Fock space) which is shared by the N electrons. If 
each electron had its private Fock space, things would be simpler. The electrons 
are intimately connected to the photons and can not be separated from them. 
In other words, in this model there is no such thing, even conceptually, as an 
‘undressed electron’. 

It is also possible to define the quantized analogue of the original Brown— 
Ravenhall model, where the physical Hilbert-space for the electrons is taken to 
be the positive spectral subspace of all the Do,;. In this case the physical Hilbert 
space does have a tensor product structure, but this simplification comes with a 
price; the model is unstable. 

The conclusions concerning the stability of these two models with quantized 
electromagnetic field are essentially the same as in the case of classical fields 
discussed in the previous chapter, except for a modification of some of the 
constants in the stability bounds. 

The following stability criterion for the modified Brown—Ravenhall model is 
proved by Lieb and Loss in [1 19]. It should be compared with the non-quantized 
version in Theorem 10.1. 


Theorem 11.2 (Stability of Modified B—-R Model with Quantized Field). Let 
kK = max{64.5, 7Z} and n = min{4(ka)*, 1}. Assume that (ka)? < n and 


[n — (ka)? /? > (0.060)8z 77a. 


Let H be the Hamiltonian (11.3.1). Then 


18 
(Wy, Hy) > J/1-nN-—AN*4M'4 


TU 2 


(2 6 + (a/2NV2Z + a)" : 
for all w € (T], APIA” L2(R?; C*)] @ F) with (Wy, W) = 1. In particular, 
Z < 42 is allowed for a = 1/137. 


Although the proof of Theorem 11.2 in [119] does not touch upon concepts 
not already seen in the previous chapters, it is rather lengthy and technical. We 
shall not give it here, but refer the interested reader to the original work in [119] 
instead. 
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The fact that there is always instability for the original Brown—Ravenhall 
model, as in Theorem 10.2 in the case of classical fields, was proved in [81] by 
Griesemer and Tix. They were able to reduce the problem to the non-quantized 
case by judicious use of coherent states of the field. These are states in Fock 
space that are eigenstates of the annihilation operators. That is, for any complex 
valued function 7(k, 4), there is a coherent state VW, € F such that a(k, A), = 
n(k, A)W,,. In particular, (W,, A(x) Y,) = Aca(x) for all x € R? and 


1 
(Wy, Ay Uy) = / [2attace? + gn Bale? | dx, 
R3 


where A, By and I, are the classical fields defined in (11.1.25)-(11.1.27) 
with n(k, A) in place of a(k, 4). The problem is thus reduced to the case of 
classical fields. 

Because of the presence of the ultraviolet cutoff, however, the model is actually 
stable of the first kind. One way to see this is to use the operator inequality 


2 — 8A 2 2 3 
A(x)* < aoe +—A‘ forallx eR, (11.3.2) 
au 


which was proved in[119, App. B]. Its proof is similar to the proof of Lemma | 1.1 
using Schwarz’s inequality. Using operator monotonicity of the square root (cf. 
the footnote on page 87), it follows from this that 


a al ISA 2 
;:A DQ = A ))| > —N,/—A —A2 
yo = y |A(x;)| = an ae 


i=l i=l 


and hence 


N 

3.2 

Hp > Va) > a; (AQ) SHA € + =n) : (11.3.3) 
i=l 


The remaining part of the Hamiltonian is Yo Do,;, which is positive by defini- 
tion. Even if one adds the Coulomb potential V-(X, R) the model is stable of the 
first kind for small enough values of a and Za; this follows from Theorem 10.1. 

The model is always unstable of the second kind, however. This is our second 
example of stability of the first but not the second kind. (The first was non- 
relativistic charged bosons with Ey * —N?/?,) In fact, by proceeding as in the 
proof of Theorem 10.2 we see that even with an ultraviolet cutoff one can make 
the energy as negative as -Ca A N? for some constant C > 0. That is, the lower 
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bound (11.3.3) is sharp except for the constant. (The fact that the bound is linear 
in A follows from dimensional analysis. The only inverse length in the problem, 
besides A, is the mass, which can without loss of generality be set equal to zero 
when analyzing stability, since the 6m term can raise or lower the energy at 
most by mN.) We leave the details to the reader. 

The various conclusions are summarized in the following tables borrowed 


from [119]. 


Electrons defined by projection onto the positive subspace of Do, the free 


Dirac operator 


Classical or quantized field 
without cutoff A 
a > 0 but arbitrarily small. 


Classical or quantized field 
with cutoff A 
a > 0 but arbitrarily small. 


Without Coulomb 
potential a Vc 


Instability of 
the first kind 


Stability of the first kind. 
Instability of the second kind 


With Coulomb 
potential a Vc 


Instability of 
the first kind 


Instability of the second kind. 
Stability of the first kind when 
both a and Za are small enough 


Electrons defined by projection onto the positive 
subspace of Da, the Dirac operator with field 


Classical field with or without cutoff A 
or quantized field with cutoff A 


Without Coulomb 
potential waVc 


The Hamiltonian is positive 


With Coulomb 
potential a Vc 


Instability of the first kind when either 
a or Za is too large 


Stability of the second kind when 
both a and Za are small enough 


CHAPTER 12 


The Ionization Problem, and 
the Dependence of the Energy on 
N and M Separately 


12.1 Introduction 


The results in the preceding chapters on stability of the second kind were mostly 
of the form Ey > —C(N + M), where Ep denotes the ground state energy of 
the system, and N and M are the number of electrons and nuclei, respectively. 
It is obvious, however, that if N is very large or very small compared to M 
the excess number of particles, positive or negative as the case may be, will 
float away to infinity. In other words it ought to be possible to reformulate the 
previous results as Ey > —C’ min{N, M} for a suitable C’ that depends only 
on the nuclear-electron charge ratio Z. 

In the relativistic case discussed in Chapter 8, the energy is actually non- 
negative for suitable w and Z, independently of N and M. From this we con- 
clude that also the non-relativistic energy can be bounded below by Ey > —CN 
independent of M, as discussed in Remark 8.6. (For an alternative method, see 
[83, Thm. 3 of Part II].) Also the results in Chapters 9 and 10 yielded bounds 
of this form, since they rely in an essential way on the non-negativity of the 
relativistic energy in Chapter 8. This answers half the problem, namely it gives 
a bound on the energy of the correct form if M is larger than N. 

In this chapter we shall deal with the other half of this problem, namely we 
shall show that for many models the energy can be bounded from below by 
M independently of N. More precisely, we shall show that if N > 2Zio%4 + M, 
where Ziot = ye Z, is the total nuclear charge, one can remove the excess 
electrons without raising the energy. In other words, a system consisting of 
M nuclei having charges Z, cannot bind more than 2Z;., + M electrons; the 
remaining electrons will move off to infinity. Consequently a bound of the 
form Eo(N, M) > —C(N + M) implies that Eg(N, M) > Eo(2Zi4 + M, M) > 
—2C (Zio + M) independently of N. While the bound 2Z,., + M is far from 
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what is expected (namely Zi + const. M, see Section 12.3) it is adequate for 
our purposes. 


12.2. Bound on the Maximum Ionization 


By the word ionization we mean negative ionization, namely N is such that 


Tt = N = Zot > 0. (12.2.1) 


(For fixed nuclear positions and charges there is no bound on the possible positive 
ionization other than —Z < Zio, i.e., when N = 0.) We say that N electrons can 
be bound if Eo(V) < Eo(N — 1), 1.e., it is necessary to use some energy to 
move one electron to infinity. The following upper bound on the number of 
electrons that can be bound by a collection of nuclei applies to several of the 
models considered in this book, but not all of them. 

Specifically, we shall consider the cases in which the kinetic energy of the 
electrons is given by one of the two forms: 


e Non-relativistic 


T= 5 + JaA(x))? (12.2.2) 
¢ Relativistic 
T= Jo + /JaA(x))? + m2 —m (12.2.3) 


for some m > 0. 


As before, A denotes a (classical) magnetic vector potential. Different func- 
tions of (p + JaAy could also be considered as kinetic energies, but we shall 
not do so here. The cases we do not know how to handle are the Pauli and Dirac 
operators studied in Chapters 9 and 10.! 


' In [161] the maximal ionization of atoms described by the Pauli Hamiltonian is studied, and it is 
shown that a B dependent upper bound can be obtained. In contrast, the upper bound obtained 
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The many-body Hamiltonian for N electrons and M nuclei is 


N 
Hy = ))T; + aVc(X, R), (12.2.4) 


i=1 


with Vc given in (2.1.21). It acts on wave functions yw € ia L?(R?; C4) that 
are antisymmetric functions of the space-spin variables. This antisymmetry 
requirement is not important in the following, however, because the results in 
this chapter apply equally well to bosons. 

We shall consider the nuclei to be point-like and fixed.” The bound on the 
number of electrons that can be bound to the nuclei will actually hold for all 
nuclear configurations, not merely the one minimizing the energy. Similarly, 
we shall consider the magnetic vector potential A to be fixed. Hence we have 
omitted the magnetic field energy Enag(B) in (12.2.4), which is not relevant to 
the question addressed here. 

The main result of this chapter is the following. 


Theorem 12.1 (Bound on the Maximum Ionization). Let Eo(N) denote 
the ground state energy of Hy in (12.2.4). If Eg(N) < Eo(N — 1) then N < 
2Ztot + M. 


Part of the celebrated HVZ Theorem, going back to the work of Hunziker, 
van Winter and Zhislin [95, 183, 190], states that under the assumption Eo(N) < 
Eo(N — 1) there really is an eigenvalue at the bottom of the spectrum of Hy. 
That is, there is a y € L? such that Hy = Eo(N)W. Since it will be useful 
later, we state this as a separate lemma.° 


Lemma 12.1 (Part of the HVZ Theorem). Assume that E)(N) < Eo(N — 1). 
Then there exists a ground state eigenfunction of Hy in (12.2.4). 


The lemma is intuitively very obvious, but a rigorous proof is lengthy and 
complicated. We refer to [150] or [82].* Lemma 12.1 is useful in the proof 


N 


For results on the model with dynamic nuclei and/or smeared out nuclei, see [1 14]. 


w 


The full HVZ Theorem states that the essential spectrum of Hy starts at the ground state energy 
of Hy-1. See [178] or [150, Vol. 4]. In particular, if Eo(N) < Eo(N — 1), then Eo(N) is below 
the essential spectrum and, therefore, is an eigenvalue. 


In the relativistic case, Lemma 12.1 was proved in [106]. See also [140, 143] for the case of the 
Brown-Ravenhall model. 
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of Theorem 12.1, but is not absolutely necessary. At the end of the proof of 
Theorem 12.1, we shall show how to avoid its use. We do this in order to give a 
self-contained proof of Theorem 12.1, since we do not prove Lemma 12.1 here. 

Theorem 12.1 actually holds under the weaker assumption that there exists 
an eigenfunction corresponding to the lowest eigenvalue Eo(N). That is, 
N < 2Zto + M holds in case a ground state eigenfunction exists, even in case 
Eo(N) = Eo(N — 1). The theorem states that the maximal number of electrons 
that can be bound by a collection of nuclei is strictly smaller that 2Z,., + M. In 
particular H~~ (a system of one proton and three electrons) does not exist (at 
least in the approximation in which the nucleus is static). On the other hand, it 
is a theorem of Zhislin [190] that if N < Zi, + 1 then the system is bound. For 
completeness, we shall give a proof of this statement later in Theorem 12.2. 

If we assume that B(x) decays to zero at infinity, then clearly Eo(N) < 
Eo(N — 1) for all N since we can always place the additional electron arbitrarily 
far from the nuclei with infinitesimal energy cost. The strictness of the inequality 
is the crucial criterion for whether the system is bound or not. 

For simplicity, we shall prove Theorem 12.1 only in the case that all the nuclear 
charges are equal. We refer to [114] for the general case. For the application 
of Theorem 12.1 to the question of bounding the ground state energy Eo from 
below by —C min{N, M}, as discussed in the introduction to this chapter, we 
can safely consider only the case in which all the nuclear charges Z; are equal 
to some common value Z. This was pointed out in Subsection 3.2.3, where it 
was shown that for any given configuration of nuclei positions and Z; € [0, Z], 
the lowest ground state energy is obtained when, for each j, Z; is either Z or 
Zero. 


Proof of Theorem 12.1. First, let us prove the theorem under the assumption that 
there is an eigenfunction yw of Hy corresponding to the eigenvalue Eo(N). This 
assumption is the content of Lemma 12.1 above but, since we do not include its 
proof here, we shall explain how to avoid the use of this assumption at the end 
of this proof. 

The Schrodinger equation for w can be written as 


0 = (Ay — Ed(N))w (12.2.5) 


Zia = a 

k 

= ( Hy1+ Tw — >> ———— + )>§ —— _ - (N Ww. 
( —_ 7 |X Rx| i=1 x; Xn| se ) 


k=1 
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Pick a strictly positive function ¢@ on R* and multiply (12.2.5) by 
W(z1,.--,Zn)/@(xy). Recall that z = (x, 0) denotes space-spin variables, and 
that w is antisymmetric in these variables. Since Hy_, > Eo(N — 1) > Eo(N), 
the operator Hy_; — Eo(NV) is non-negative on antisymmetric functions of the 
variables Z;,..., Zy—1. Since it does not affect the zy variable, this implies that 


v(z)—— ne 2 ((Hy-1 — Ey) W) (z)dz = 0. (12.2.6) 


Moreover, if ¢ is superharmonic (that is, Ad < 0), the term with Ty is 
positive in view of the following lemma, which was inspired by unpublished 
work of Benguria (see [1 13, p. 632]). 


Lemma 12.2 (Positivity Property of T). Let T be either one of the operators 
(12.2.2) or (12.2.3). If @ is a non-negative superharmonic function, then 


T+T—>0 
d o- 


in the sense of quadratic forms. Equivalently, for any w € L*(R°), 
Re (w/o, T wy) = 0. 


Technicalities concerning the domain of definition of the operators shall be 
blithely ignored here. We refer the interested to reader to [114]. 

We postpone the proof of this lemma, and conclude the proof of Theorem 12.1 
first. After integration of all the z; we obtain from (12.2.5) and (12.2.6), with 
the aid of Lemma 12.2 (and the fact that the real part of zero is zero) that 


fworz re, eee ee, 
~ b(tw) \ St ew — Rel Se xi — xl PO 


Since ||? is symmetric in all the variables, this can be written as 


2 
fv (-x PON Dea 


ls 1 


ya 1 
—— + —— } ———— ]d 0. 12.2.7 
3 Cenur s) ae ee 


i= 


rf 
2 


First, consider the case of an atom, i.e, M = 1 and R; = 0. We choose 
d(x) = 1/|x|. By the triangle inequality, 1/@(x;) + 1/é@(«y) = |x;| + |xn| = 
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|x; — xy|, and this inequality is strict except on a subset of R? x R? of measure 
zero. Hence (12.2.7) yields 


1 
Ae tN 10, 


or N <2Z + 1, as claimed. 

Next, consider the molecular case with all nuclear charges taken to be equal, 
ie., Z, = Z for all k = 1,..., M. We choose ¢ to be (x) = )>, |x — R,|7. 
Then 


1, 1 _ oi) + en) 
P(xi) (xn) P(X; )P(X v) 
M 
=) ge(xi)ge(xn) (li — Rel + [Xv — Ril), 
k=1 


where we let g(x) = 1/(¢@(x)|x — Rx|). Again, by the triangle inequality, 
|x; — R,| +|xyv — Ry| = |x; — xy]. Using the symmetry of |y|?, (12.2.7) then 
becomes 


M 
[iv@r (LX euexnentey | de <20z. 
k=1 i¢j 
The strictness of the inequality follows again from the fact that the above triangle 
inequality is strict except on a set of measure zero. 

For given k, we write igi g(x: )ge(x )) =| 0; sexi)? — Do; ge(xi)?. To 
estimate the latter term from below, we can use the fact that 5°, g,(x;) = 1 
to conclude that }*, 5°; gx(x;)? < N. For the first term, we use Schwarz’s 
inequality 


2 
N N? 
2 s1(e) 22 2 840) arr 


Summing over k and using again that }°, g,(x;) = 1, we conclude that 


> ax(xi) 


i 


M 


Ds 


k=1 


‘ 2 
Sk 
M 


In combination, we have thus shown that 


N2 
—-—WN <2ZN, 
M 


or N < M(2Z + 1), which is the desired bound. 
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The case of unequal nuclear charges is slightly more complicated, and we 
shall not give the details here, but refer the reader to [114]. It turns out that the 
natural choice (x) = )°, Z,/|x — Rx| does not work, however. One rather has 
to pick suitable positive numbers jz, and choose $(x) = >>, Me/|x — Rx|. For 
a suitable choice of these numbers, one can then show that N <2 )°, Z, + M. 

As promised, we shall now relax the assumption that there exists an eigenfunc- 
tion of Hy with eigenvalue Eo(NV). By hypothesis, Eg(N) < Eo(N — 1) — 3e 
for some ¢ > 0. By definition of the ground state energy, there as a yy such 
that (WwW, Hyw) < Eo(N — 1) — 2e. Since infinitely differentiable functions of 
compact support are dense in H! and H'/* [118, Thms. 7.6, 7.14 & 7.22], we can 
find a wave function i, supported in the set |x;| < R, for suitable R, > 0 such 
that (W,, Hyw-) < Eo(N — 1) —¢. Hence the Hamiltonian Hy restricted to 
such functions (that is, functions supported in a ball of radius R, with Dirichlet 
boundary conditions) has a ground state energy strictly less than Eo(N — 1). 
We can thus repeat the above proof with y being the ground state wave function 
in a box (which always exists) and arrive at the same conclusion. | 


It remains to prove Lemma 12.2, which we shall do next. The proof is taken 
from [114]; for an alternative proof, see [96]. 


Proof of Lemma 12.2. Consider first the non-relativistic case (12.2.2). For w € 
L?(R3) and f(x) = W(x)/¢(x), we can write 


1 _ 
(W/d. TW 5 / (iV Fle) + VaA(x) Fo) 
R 
x (—iP(x)V f(x) — if (X)V G(X) + VaA(x) P(x) f(x) dx. 
(12.2.8) 


With the aid of partial integration one checks that this equals 


1 
; / ($(x)| — iV f(x) + VaAx) fw)? 


R3 


—| f(x) P(Age) + iVaAX)VO(X))) dx. 


The last term drops out when the real part is taken, since both A and ¢ are real. 
The other terms are positive by our assumptions that (x) > 0 and A¢(x) < 0. 
This proves that Re(wW/¢, T yw) => 0 in the non-relativistic case. 
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To prove the lemma in the relativistic case, we start with the integral 
representation 


CO 
B 1 
B pie-1= = f t= 1)? —_—__ai. 
ae a ( ) t(t + B) 
1 
The lemma thus follows if we can show that 
B 
R —o >0 12.2.9 
e(v/o ‘a: W= ( ) 


for all t > 1, and with B = (p + /a@A)’ being twice the non-relativistic kinetic 
energy. To see (12.2.9), let g = (t+ B)~'y. Then 


B 1 
(vie, ae v) = t(g/¢, Bg)+ G r Bs) ; 


The last term is obviously positive, and the positivity of the real part of the first 
term was shown before. 

For simplicity of presentation we have ignored issues concerning domains of 
operators here. The details are properly worked out in [114, Appendix A]. 


12.3. How Many Electrons Can an Atom or Molecule Bind? 


This question, although not directly related to the stability of matter, is a fasci- 
nating one, and we cannot resist the temptation to mention it here because, after 
many attempts, it is still open! The inequality Z < Zi. + M proved above is far 
from optimal. 

Numerical estimates and experimental observations of real atoms and 
molecules in nature leads one to believe that the maximum ionization, Z := 
N — Zio should be at most cM with c about 1, or possibly 2, irrespective of 
whether or not the nuclei are in their optimum locations. Atoms with Z = 2 are 
rare, if they exist at all.” No one has been able to prove anything resembling 
a theorem of this kind, except in the context of approximate theories, such as 
Hartree-Fock theory [171] or Thomas—Fermi type theories [8, 113]. 

This question is closely related to several others: Why are the radii of 
atoms (defined as the radius R such that J x|>R o(x)dx = 1/2, with o the 


> We are discussing atoms and molecules in vacuum, not in water, where c can be significantly 
larger, although still bounded. 
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one-particle density of the ground state) more or less independent of Z? Why 
is the ionization potential (given by Eo(N = Z) — Eo(N = Z +1) = 0) more 
or less independent of Z? Why is the largest ionization potential among ail 
atoms in the periodic table less than the smallest electronegativity (given by 
EQ(N = Z — 1) — Eo(N = Z) > 0) among all atoms? If this were not the case, 
neutral atoms with high ionization potential would be bad neighbors and would 
go around stealing electrons from innocent neutral neighbors. 

The conjecture concerning the ionization potential and the electronegativity 
implies that N ++ Eo(N) is convex at N = Z, i.e., 2Ep(N = Z) < EQ(N = 
Z—1)+ EQ(N = Z +1). It has been further conjectured that N > Eo(N) is 
convex for all NV. This, in turn, implies the even more ‘obvious’, but unproved 
fact that if there is an eigenfunction for Eo(N) then there is one for Eg(N — 1). 
That is to say, if a nucleus can bind WN electrons can it bind N — 1? All of these 
interesting questions are open ones, and we shall not discuss them further. 

It is known that a theorem stating that Z ~ M would have to take explicit 
account of the fact that electrons are fermions. Otherwise, if they were bosons, 
the ionization is as large as J ~ 0.21 Z for an atom with large Z. It was shown 
by Benguria and Lieb [14], for an atom, that JZ > cZ as Z — oo with c being 
determined by the solution to the Hartree equation, which is a certain non-linear 
differential equation. Baumgartner [9] showed numerically that this equation 
yields c = 0.21 and Solovej [170] found an upper bound which showed that 
T = 0.21Z is the correct asymptotic formula. This bosonic result shows that con- 
siderations of electrostatics alone cannot solve the problem. The Pauli exclusion 
principle is essential! 

On the other hand, the situation for fermions is not entirely hopeless. Prior 
to the proof of Theorem 12.1 Ruskai [154] had proved that Z < cZ°/> and 
Sigal [163] had proved that Z < 18Z for atoms. Later, Lieb, Sigal, Simon 
and Thirring [128] showed that Z/Z — 0 as Z — oo. Despite subsequent 
improved quantitative estimates by Seco, Sigal and Solovej [158] and by 
Fefferman and Seco [61] the problem remains basically unresolved. For realistic 
atoms the 2Z + 1 bound of Theorem 12.1 is still the best so far. There is a long 
way to go. 

Another positive note, for both fermions and “bosonic electrons’, is a theorem 
proved by Zhislin [190] which states that the number of electrons that can be 
bound is at least as big as the total nuclear charge. It was originally proved in 
the non-relativistic case without magnetic field, but it can be easily generalized 
in the following way. 
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Theorem 12.2 (Zhislin’s Binding Condition). Assume Eqag(B) < 00. If N < 
Ziot + 1 then the ground state energy Eo(N) of the Hamiltonian given in (12.2.4) 
Satisfies Ey(N) < Eo(N — 1). This holds both for fermions and for bosons. It 
also holds for all fixed locations of the nuclei, not only for the energy minimizing 
configurations. 


It can be shown that the conclusion of this theorem also holds for mov- 
able nuclei. By Lemma 12.1 the ground state of the N particle problem is an 
eigenfunction when N < Zi + 1. Thus, a physical atom (with Z;., an integer) 
can bind at least Z,., electrons. A slight modification of the proof shows that 
the N electron atom has infinitely many eigenfunctions (i.e., bound states) if 
N < Zo +1. 


Proof in the non-relativistic case. We shall give the proof in the case of 
fermions; the case of bosons is completely analogous. Our proof will make 
use of part of the HVZ Theorem, as formulated in Lemma 12.1. It is not as easy 
as in the proof of Theorem 12.1 to give a proof that does not use this lemma. 

We shall prove that if N < Z, + 1 and if there is a ground state for N — 1 
particles, then Eg(N) < Eo(N — 1). For N = 2, we merely have to show that 
Eo(1) < Eo(O) = wU(R) in order to ensure the existence of a ground state for the 
one-particle problem. (This follows from Lemma 12.1 above. In the one-particle 
case, the proof is actually much simpler; see [118, Thm. 11.5].) This inequality 
follows easily from the fact that one nucleus with an arbitrarily small positive 
charge can bind an electron. The existence of a ground state for successively 
larger N then follows by induction, using Lemma 12.1 at each step. 

Hence we may assume that there exists a ground state of Hy_,, with 
energy Eo(N — 1), which we denote by yw. We assume that y is normal- 
ized, i.e., || ||2 = 1. We first show that for any R > 0 there exists a function 
WRE | is L?(R3; C2) supported in the set |x;| < Rforalli=1,..., N—1, 
with energy (Ww, Hy_1 W) < Eo(N — 1) + c/R? for some constant c > 0. For 
this purpose, we pick a smooth function x on R*— that is supported in the 
set |x;| < R forall 1 <i < N —1. Itis straightforward to check that® 


x(X)?(p; + VaA(x;))” + (p; + VaA(x:))° x(X)? 
= 2x(X)(p; + Ja A(x;))?x(X) — 2/Vix!? (12.3.1) 


© This is the same as the IMS localization method of (11.2.9). 
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fori = 1,...,N—1.Hence 


1 1 
EQN — DW, x?) = 5 x? Hy) + 5 Hy-1x°) 


N-1 
= (x, Hy-ixw)— SOW IVixP Wy). (12.3.2) 
i=l 
We can pick x such that |V;x |? < c/(2R*). Moreover, (Ww, x7) > 1/2 for large 
enough R. This proves that Wr = x ¥/||x || has the desired property. 
Let dr € L?(R?; C2) be radial, normalized, and supported in the shell 
R < |x| <2R. We claim that we can choose @r in such a way that its 
kinetic energy satisfies limr_..o(@r, T hr)R = 0. A possible choice is éa(Z) = 
8o,1(27 R)~1/? sin(z|x|/R)/|x| for R < |x| < 2R, and zero otherwise. To bound 
the kinetic energy, we can simply use (p + aA)” < 2(p? +aA’). The p? 
term yields c/R?. For the A” term, we can use Hélder’s inequality to estimate 
(hr, A’or) < age ee |A|®°)!/3. If A € L°(R) the latter integral goes 
to zero as R — oo. It is no restriction to assume that A € L°(R*) since this 
property holds in the Coulomb gauge, as shown in Lemma 10.1, and we are free 
to choose this gauge. This proves the claim. 
Let Y denote the N particle wave function obtained by antisymmetrizing the 
function Wa(Z1,..-,ZN-1)Pr(ZNn), namely, 


1 


T — 
(Z) UN 


vata +> Zn—-1)OrR(Zy) 


N-1 
= S WR(Z1, ++ +5 Zi-15 ZNs Zigly +++ cx -a)a(e)} (12.3.3) 
i=l] 


Using the locality’ of Hy and the fact that ¢z and Wp have disjoint support, we 
observe that 


lpr(z)I 


dz 
|x| 


(Y, Hy Y) = (Wr, Hy-1 Wr) + (br, T br) — Zs f 
R3 


2 
+| Ipr(z)| On) gray, 


Ix — y| 
R3xR3 


7 For the notion of locality, see the footnote on page 181. 
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Here, Oy, denotes the one-particle density of yp. We have assumed that R is large 
enough such R > | R;| forall the nuclear positions Rx. Since @p is radial and sup- 
ported in |x| > R, this implies that f |@p(z)|?/|x — Ryldz = f |Or(z)|"/|x|dz 
by Newton’s Theorem. Similarly, the last term equals (NV — 1) f lor(z)|*/|x |dz. 
Hence 


1 — Zrot 


N— 
(Y, Hy Y) < Eg(N — 1) + a + (es T bn) + 


Recall that lime. R(dr, T dr) = 0. If N < Zot + 1, we therefore see that 
Eo(N) < (Y, Hy Y) < Eo(N — 1) for large R. i 


Proof in the relativistic case. The strategy is the same as in the non-relativistic 
case. We shall briefly explain the main differences but leave the unilluminating 
details to the reader. 

The localization formula (12.3.2) is more complicated in the relativistic case 
due to the non-locality of the kinetic energy. The localization error can still be 
bounded by c/R? independently of A, as long as R > 1/m. (The heuristics 
behind this fact is that for momenta |p| < m, the relativistic kinetic energy 
approximately equals the non-relativistic one.) This can be shown using the 
same strategy as in [72, Lemma B.1], where the case m = 0 was considered (in 
which case the localization error is ~ 1/R). 

The kinetic energy of the one-particle function dr can be bounded in the same 
way as in the non-relativistic case, using the inequality /s + m? —m < s/(2m) 
for s > 0. 

As will be pointed out in the proof of Theorem 14.2 in Chapter 14, page 265, 
locality of the one-body terms in the Hamiltonian is not necessary for the absence 
of cross-terms when computing the expectation value of the Hamiltonian in the 
antisymmetrized N-particle wave function. It is only needed for the two-body 
terms. The rest of the proof goes through without essential change. | 


CHAPTER 13 


Gravitational Stability of White Dwarfs 
and Neutron Stars 


13.1 Introduction and Astrophysical Background 


Up to now we have been concerned with demonstrating the “Triumph of Quan- 
tum Mechanics’, which is that the electrostatic forces between the electrically 
charged particles that make up ordinary matter do not cause collapse (provided 
the maximum nuclear charge Z and the fine-structure constant a are not too 
large). The total energy is not only finite but it is proportional to the particle 
number. Electric and magnetic forces conspire to cancel out to a comfortable 
extent, but lurking in the background is a very, very much weaker force — gravity. 
This force is additive, however, and there can be no cancellation as there is for 
electric forces, but because it is so weak it becomes dominant only when WN is 
very large — of the order of the number of particles in a star, which is about 10°”. 

It was already realized shortly after the publication of Schrodinger’s equation 
that a star would collapse under the influence of gravity if the kinetic energy 
of the particles is treated relativistically and if the number of constituent par- 
ticles exceeds a certain critical value; this critical value depends on Planck’s 
constant, h. 

There are two kinds of stars to consider. One is a star made of electrically 
neutral particles called neutrons, and which is itself the residue of a collapsed 
star. Its mass is typically of the order of 1—2 solar masses, but gravity squeezes 
it to a radius of about 10-20 km. In contrast, the radius of our sun is roughly 
one million kilometers. 

The second kind is an ordinary star, but which has burned out, i.e., its nuclear 
processes have finished. What is left is essentially a system of electrons and 
nuclei close to its ground state, as noted by Fowler [69]. The temperature of 
such stars can still be high enough for them to twinkle brightly, which is why we 
can see them optically and which is why they are called ‘white’. But although 
their masses are comparable to the mass of the sun they are tiny (about the 
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size of Earth, but still much bigger than a neutron star) which is why they 
are called ‘dwarfs’. Anderson [3] and Stoner [175] first noted that there is a 
maximum possible mass of such a star now called the “Chandrasekhar mass’ 
after Chandrasekhar who put the calculation on a firm footing [30]. Beyond this 
mass the star collapses under its own gravity. Indeed, a white dwarf can slowly 
accrete matter from a companion star and, when its total mass exceeds about 1.4 
solar masses, it collapses as a type Ia supernova. 

Before studying the collapse, let us discuss, heuristically, the energy balance 
for non-relativistic gravitating matter. If we have N fermions of mass m with 
non-relativistic kinetic energy confined to a ball of radius R, the kinetic energy 
will be approximately N°/?/(R*m), as we saw in Chapter 4. The gravitational 
energy would be roughly —« N/R, where 


kK = Gm? 


is the effective interaction constant and G is Newton’s gravitational constant 
(3.2.10). The total energy is minimized when R ~ N~'/3/(mx). Owing to 
the negative exponent —1/3, ‘a large star is smaller than a small star’. For 
us the important point is that the energy is always finite and there is no collapse, 
in the sense that the system is stable of the first kind. The system is not stable of 
the second kind, since the energy grows like N”/? for large N [105]. For neutron 
stars, we can set m equal to the mass of neutrons which is much larger than the 
mass of electrons. 

The reason that white dwarfs are much larger than neutron stars is that there 
are two masses to consider in the former case. There is the mass ju of the nuclei 
(which is similar to the mass m, of a neutron) so that « = Gy. The other is 
the electron mass m, so that the dominant kinetic energy in the non-relativistic 
regime is N>/? /(mR7). This is much larger than the kinetic energy of N neutrons, 
N°/3 /(m,R7). The balance is now when R ~ N~!/3/(Gm?mz). In Section 13.2 
we address the one-mass case. The two-mass case is discussed in Section 13.3.3. 

Now let us discuss the effect of special relativity, which becomes important 
for such dense systems. The kinetic energy p*/(2m) of the particles has to be 
replaced by ,/ p? + m? — m. For large N, the kinetic energy of a collection of 
fermions confined to a ball of radius R will now only be N*/3/R, independent 
of m > 0. The potential energy estimate given above remains the same and, 
therefore, the system will collapse if N is bigger than roughly «~>/?. This critical 
particle number depends on m only through « and not through the kinetic energy 
/ p? +m? — m. The value of m in the kinetic energy is important, however, for 
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calculating various properties of stars with mass below the critical one, such as 
the radius. 

The question we address here is whether « is really the correct critical 
particle number for collapse. The collapse could conceivably occur with a smaller 
N, in principle, because of the short-distance singularity of the interparticle 
gravitational potential —«/|x; — x ;|. In fact, it does not do so. The heuristic 
discussion given above turns out to give the correct answer. 


—3/2 


13.2 Stability and Instability Bounds 


In this section, we consider the simplest model of gravitating particles, in which 
there is only one species of particles of mass m, which are either bosons or 
fermions. As discussed in the introduction, it will be important that the kinetic 
energy of the particles is relativistic. Since | p| — m < ,/p? +m? —m < |p|, the 
stability analysis, i.e., the question of the number of particles needed for collapse, 
does not depend on the factor m in the kinetic energy (but m is important in the 
potential energy), and hence we will simply use |p| as the kinetic energy of a 
particle of momentum p in the following. The basic Hamiltonian is then 


N 
1 
i=l : . 


1<i<j<N 


with k = Gm? > 0. By scaling, the ground state energy Eo of H is either 0 or 
—OO. 


Theorem 13.1 (Stability of Gravitating Matter). The ground state energy of 
the Hamiltonian (13.2.1) for N fermions with q spin states is, for small x, 


—oo if N > (9/2)37/4m «—3/2g-/? + o(e-7/?), 


E ‘i if N <0.594«73/2g-1/2 + o(¢-3/2) 
0 — 


For bosons, 


Hiee 0 ifN <14+4«7!/n 
9 )-0o if N > 141287! /(1520). 
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This theorem goes back to Lieb and Thirring [136], and the proof we give will 
follow a similar strategy, with some simplification using results from Chapter 8. 

The theorem shows that the critical particle number scales as «~*/* for small 
« in the fermionic case, and as x! in the bosonic case. It is actually possible 
to determine the precise asymptotic behavior of the critical particle number as 
«x —> 0. This will be explained in the next section. 


Proof. Bosons. Upper Bound. For bosons, we simply take a product trial func- 


tion of the form inne , P(x;) for some ¢ € H '/2(R3). For such a wave function, 
the energy equals 


NIN 1 2Ib(y)|2 
Givin 5 ff — dxdy. (13.2.2) 


R3xR? 
The energy is thus negative if 


ll 


2(, |p|) Ip)" 1o(y)? 

k > ————— ———————dxdy 
N-1 |x —y| 
R?xR? 
Once the energy is negative, it can be driven to —oo by scaling, as mentioned 
above. If we choose #(x) = a7 !/? exp(—|x|) as in the proof of Theorem 8.2, we 
find (, |p|) = 8/Gz) and f |P(x)|7|G(y)/?|x — y|"'dxdy = 5/8. The sys- 
tem is thus unstable if « > 128/(15z(N — 1)). 
Bosons. Lower Bound. We write the Hamiltonian (13.2.1) as 


1 K 
H= ——| p,| — ————__ } . 13.2.3 
De (sai imei) (13.2.3) 
1<if#j<N 


For given xj, |p;| = (2/7)|x; — 22 ier as shown in Chapter 8, Lemma 8.2. 
Hence H > Oif xk < (4/m)/(N — 1). 
Fermions. Upper Bound. As a trial function, we take the Slater determinant 


of the N lowest eigenfunctions of the Laplacian in a cube of side length L, 
with Dirichlet boundary conditions. The corresponding eigenvalues are given by 
m°n*L~* withn € N°. Since (¢, |p|) < (¢, p? ¢)' for any ¢ with (¢, ¢) = 1, 
the kinetic energy is bounded from above by 


go le 


|n|<K 
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where K is the smallest number such that the number of points n € N° with 
|n| < K is bigger or equal to N/q. For large N, K © (6N/zq)'/, and hence 


1/3 74/3 
612 N 
q s |n| L~ (= ) “i. : 


jn|<K q 


(Compare with Section 4.2.) Since all the particles are inside a cubic box of side 
length L, |x; —xj| < V/3L for all i # j. Hence 
1 N(N — 1) 


[eee al” 2/3 L 


1<i<j<N 


The total energy is thus negative under the assumption on WN stated in the 
theorem. 

Fermions. Lower Bound. If yy is either symmetric or antisymmetric then, for 
1<L<NandM=N-L, 


N = 
(Yy, HW) = Fale Ay), (13.2.4) 
where 
L Lo oM 1 1 
H= |—~A on —___., 13.2.5 
2 IPil ea ps |x; —x;| : 
i=l i=1 j=1 / 1<j<k<M 4 


and where A and @ have to satisfy the constraint 
ALM — 5 MM he 5 LN =), (13.2.6) 


Equation (13.2.4) follows simply by counting the number of kinetic energy 
and potential energy terms, given that these terms do not depend on which 
particles are involved because of the symmetry assumption on the wave 
function y. 

For simplicity, let us introduce the notation R; = xj+, for j= 1,..., M. 
There is no kinetic energy for the R particles in (13.2.5), hence we can regard 
them as fixed. We then derive a lower bound on H acting on totally antisymmetric 
functions of L variables only. 

For fixed R,,..., Ry, we introduce again the Voronoi cells I’;, as in 
Section 5.2. Similarly, let ® : k= (0, co] denote the function which takes 
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the value 


With D(x) = min; |x — Rj| as before, we can write 


7 L L 1 1 
f= ipl-20 (en) +0) +e > RoR 
i=1 i=l 


1<k<l<M 


The last term here is the same as in (5.2.5) for Z = 1. 
Pick some 0 < € < 1 and split the kinetic energy as | p| = e|p| + (1 — €)| pl. 
On the space of antisymmetric functions, the operator 


L 
Y> (= pil — A&C) 


i=1 


is bounded from below by gq times the sum of the negative eigenvalues 
of (1 — €)|p| — A®(x). From the relativistic LT inequality in Theorem 4.2, 
together with the bound on the optimal constant in Eq. (4.1.23), it follows 
that 


v4 
((1 ~ e)1pil — 2@(2)) = -0.02577 = ; |@(x)|4dx. 
R3 


L 
i= 


1 


To bound the latter integral, we first note that for x ¢IT; and Dj; = 
mins; |Rj = R;|/2 


dQ 
Pie) = aa Ix — Ry — DQ| 


kK, kAj g2 


where & is a vector of length 1 and d& denotes the normalized surface measure 
of the unit sphere S?. (This is just Newton’s theorem.) Hence, by Schwarz’s 
inequality, and for x <€ I;, 


M 

dQ dQ 

@(x)*? <(M—1 <M : 

Ores S| a =) aoe 
kK, KAJ go k=1 go 
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The last term is independent of j and is an upper bound valid in all of R*. The 
integral of |®(x)|* is thus bounded by 


/ |®(x)|*dx < Mwy j dx / sl / ae 
~ |x — Ry — Di Q\? J |x — R; — DQ’? 
R3 kl 3 g2 g2 


=: ie = II ood (13.2.7) 
|R, — R; -— D,Q + D,Q’|’ ~ 


The last equality uses Eq. (5.1.7). For k 4 1, the last integral is just |Rx — R)|~! 
by Newton’s Theorem (since |R, — R;| > D, + D;). Similarly, for k = / it is 
Dee, which can be trivially bounded by 2 a jek Rj — R,|~'. Hence 


1 

|D(x)|\4dx < 677>M? — 
| P25 [Ri ~ Ri| 
R3 = => 


We are left with the task of finding a lower bound to 


: r 
x Ga = sen): (13.2.8) 


Assume that ¢ > 24/2. It was proved in Chapter 8, Lemma 8.5, that (13.2.8) is 
bounded from below by 


M 
—1.514———____ 
(€ — 4 Ly 
Again noting that Ds 22 x ek IRj — R,|~', this proves that 


es [. _ 40.0257 3442 4 9.0564 3 ke 
~ (i=ey (e-1d/2) | ey [Re — Ril 


(13.2.9) 


We shall choose M large and 4 and ¢ small. In this case, the third term 
in square parentheses in (13.2.9) is negligible compared to the second term. 
We choose a = oAtM? with o = 0.0257 x 62°q and AM = (N/20)!/3, with 
A<landl1<M <«N. To leading order, the equation (13.2.6) then becomes 


= 3 96)-13 N27, 
2 
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In other words, we have shown that H > Oif N < (20)~!/2(3/2«)*/? plus terms 
of lower order for small x. | 


13.3. A More Complete Picture 
13.3.1 Relativistic Gravitating Fermions 


As stated in the Introduction, a more accurate model uses the Hamiltonian 


N 
/ 1 


l<i<j<n 


with m > 0. As pointed out above, the critical value of N for stability does not 
depend on m, since |p| — m < ,/|p;|* +m? — m < |p|. Other properties for 
subcritical N, like the size of the star, for instance, will depend on m. In the limit 
of small « and large N one would expect that to leading order the energy and 
particle density is exactly given by the semiclassical approximation, by which 
we mean the following. 

First, the gravitational energy is approximated by the classical energy of a 
particle distribution described by g(x), given by 


= /| A) srady = ghee py), 
2 Ix — y| 


R3xR3 


where the same notation as in (5.1.3) is used. The lowest kinetic energy for a 
given density function 9 in the semiclassical approximation is 


i: J(Q(x))dx 
R 


where j is defined as 


j= : (vie? + m2 — m) dk. 
(27) 
\k| <(6t/q)3 


(Compare with the discussion of the semiclassical approximation in 
Section 4.1.1.) This integral can be evaluated explicitly. It is 


j 2 sg 
io=— ron +m — tin (EVE Em wi ) Jom 


16772 
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with n = (67t/q)'/3. For large t, j(t) © (3/4)(627/q)'/314/3, whereas for small 
t, j(t) = 3/10m)(627/q)°7 0°, which is exactly what we would expect in these 
limits. 

Altogether, the semiclassical energy functional is thus defined to be 


gelassical 9) — / j(o(x))dx — « D(@, Q). (13.3.2) 
R3 


Its infimum over all non-negative densities @ with te o(x)dx = N is the semi- 
classical energy E('*sl(, «, m). It has the scaling property 


yi aaaaaas Ce K, m) = NY Eee, KN, mN7"/3), 


From this one immediately deduces that, within the semiclassical approximation, 
there is a critical value of « N?/? for stability of the system. In other words, the 
critical value of N, which we denote by N,, equals a constant times Kk 3/? in 
agreement with our bounds in Theorem 13.1. 

There are two questions one would like to ask. 


¢ Does the quantum-mechanical problem (13.3.1) lead to this semiclassical 
minimization problem to leading order as « > 0 and N > oo, with N«*/? 
fixed? 

¢ Is there a minimizing o for °°! and is it unique (up to translations, of 
course)? 


It is easy to show that if a minimizing @ for €°'**<*! exists, it satisfies the 
variational equation 


oP ay = (13.3.3) 


76G)=\% / 
|x 
R3 


where [- ]4 denotes the positive part, and jz > 0 is some constant, adjusted so 
that f o(x)dx = N. 

The answer to both these questions is affirmative, as proved by Lieb and Yau 
in [137]. The most difficult part in the proof of the correctness of E¢!*ssi*@l is 
the lower bound on the ground state energy of H in (13.3.1), and this follows 
the method in our Theorem 13.1 except that the estimates are done much more 
carefully in [137]. 


-- 
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With the aid of standard techniques in the calculus of variations it is not hard 
to prove that there is a minimizer of €*'°*'**! as long as N < N;. The proof of 
the uniqueness of a solution of the equation (13.3.3) for N < N, is somewhat 
non-standard and turns out to be rather involved.! When N = JN, there is also a 
minimizer of €*°**! if we set m = 0, i.e., replace j(t) by (3/4)(627/q)!/714/?. 
The solution of the corresponding variational equation (known as the Lane— 
Emden equation) turns out to be unique up to translations and rescaling; that is, 
one can replace o(x) by AXe(Ax + y) for any 4 > O and y € R*. 


13.3.2 Relativistic Gravitating Bosons 


The situation for bosons is similar to the fermionic case discussed in the previous 
subsection. The effective functional for small « and large N is now the Hartree 
functional 


ellanmee (9) — ( Te: (Vip? +m _ m) Je) ~ KD(o, 0). (13.3.4) 


It results by assuming that all the particles are in the same one-particle state 
d(x) = /o(x)/N,’ compare with Eq. (13.2.2). That is, the many-body wave 
function is of the form w(x,,...,Xy) = Bee , ?(x;). The Hartree energy, which 
is the infimum of (13.3.4) over all non-negative @ with Te o(x)dx = N, has the 
scaling property 


peHaniee( K, m) _— N Ebay KN, mN~'). 


This shows that the critical value of N, denoted again by N,, now equals a 
constant times «~!, in accordance with our bounds in Theorem 13.1. 

It was proved in [137] that the Hartree functional (13.3.4) correctly describes 
the ground state energy of (13.3.1) for bosons in the limit k + 0 and N — oo 


Equation (13.3.3) is treated in many physics textbooks, see, e.g., [30] or [186, Sect. II.3.4], 
by converting it into a second order differential equation in the radial variable |x|. It is an 
elementary fact that the resulting equation has a unique solution if the value of @ at the origin 
is specified, and textbooks tend to leave it at that. The difficult part of the proof in [137] is to 
show that there can only be one such value for a given N. 

It is not a restriction to assume that @ is non-negative, as we do here, since the representa- 
tion (3.2.17) shows that replacing @ by || can only lower the kinetic energy. 
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with «N fixed. Moreover, there is a minimizing ¢(x) = /o(x) for €#*"t*e as 
long as N < N,, which satisfies the Hartree equation 


2 
(Vipie +m? — m) ox) —« f POM px) = —nper. 


; Ix — y| 
R 


Uniqueness of solutions to this equation is still an open problem, in general. For 
small values of x f |\@(x)|?dx uniqueness was proved by Lenzmann [104].° 


13.3.3. Inclusion of Coulomb Forces 


In Section 13.2 we obtained bounds on the critical number of particles for 
collapse of a gravitating system of either bosons or fermions. In the model con- 
sidered there, there was only one kind of particle and the gravitational attraction 
among them was proportional to k = Gm?, where m is the mass of the parti- 
cle. This model is appropriate for a neutron star (neglecting general relativistic 
effects and nuclear forces, etc), in which case m equals the mass of a neutron 
my, Which is approximately equal to the mass of a proton, namely ~ 2000 in 
our units (where the electron mass equals 1). The main conclusion was that the 
critical particle number N, is proportional to x~*/? since neutrons are fermions. 

For the physical values of m, and G the resulting critical particle number is 
roughly N. 10°’, and the mass of the neutron star, which is N,m,, is of the 
same order as the mass of our sun, but the neutron star is much smaller in size 
(by a factor of 10°). The main reason for this size difference, as mentioned in 
Section 13.1, is the lack of electrons in neutron stars, which are very light but 
produce a big pressure. If neutrons were bosons, N, would be proportional to 
«~', in which case N, © 1038, and N.m, * the mass of a mountain instead of 
the mass of a star. 

White dwarfs, on the other hand, can be thought of as consisting of a gas of 
nuclei and electrons, and hence not simply of one species of electrically neutral 
gravitating particles. The main gravitational energy in a white dwarf comes from 
the nuclei, since they are much heavier than the electrons. The Pauli principle for 
the electrons is responsible for the pressure that keeps the star from collapsing, 
however, even in case the nuclei are bosons. An approximate model to consider 


> For the non-relativistic analogue of the Hartree equation, existence and uniqueness of solutions 
was proved in [109]. 
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is the Hamiltonian (13.2.1) for the electrons, but with k = G(u/Z)*, however, 
where jz and Z are the mass and charge of the nuclei. While this model is the one 
usually employed, it ignores the fact that there are really two kinds of charged 
particles which make different contributions to the energy. 

In this section we show that a model in which there are electrons and nuclei 
with both electrostatic and gravitational interactions is stable in essentially the 
same parameter region as the simpler Hamiltonian (13.2.1) for just one species 
of (neutral) particles, with effective coupling constant that is proportional to the 
square of the mass per unit charge of the heavier particles. The presentation is 
similar to the one in [136]. No attempt is made to evaluate the precise constants 
as we did earlier. 

The Hamiltonian for this model is given by 


M 


#=32(Jioe-tm—m) + 3° (VPP ea») 


i=1 k=1 


1<i<j<N 


(13.3.5) 


where P;, and R; denote the momenta and positions of the nuclei, which could 
either be bosons or fermions. The charge and mass of the nuclei are Z and jp, 
respectively. The electron mass m = 1 in our units, but for transparency we shall 
retain it in our notation. 

For stability, we have to assume that Za + Gm < 2/7, which, for physical 
values, is satisfied for all nuclei with Z < 87. 

We shall also assume in the following that Z*a > Gy”. This condition is 
amply satisfied in practice, since Gu?/(Z7a) ~ 10~* for all nuclei in the peri- 
odic table. (In fact, j4/Z is roughly constant throughout the periodic table, since 
atomic nuclei contain roughly as many neutrons as protons.) 

For convenience, we define an effective nuclear charge to be 

2 
Zp =4{ Z2 - ela 
a 
and an effective electron charge to be 

Za+Gmu 

7 Zpa ; 


e 
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Then H can be written as 


N M 
aZ—K aZ; aZ.Zp 
2 a Ry a te — Ri 
l<i<j<n ! J 1<k<l<M i=l k=1 "' 
(13.3.6) 
with 
=4 
Ly? Gu? 
a5G ( =) ee 
. ae ( aa) 
In practice, 2000 < w/(Zm) < 6000, and hence x © Gy?/Z?. 
We choose some 0 < 7 < | and write H = H© + H®, where 
N M 
cH es ler +> (ViPiP +H - x) 
i=1 k=1 
aZ? Ae WE, 
ae z 2) Deer 7 
jy a or 1<k<l<M [Ri — Ril i=1 k=l |x; — Re 
(135.7) 


and 


N 
K 
Ho=0S'(JinPtm—m)- Pe 
i=1 lee — 2; I 


l<i<j<N 


The Hamiltonian H© contains only Coulomb forces. For appropriate values of 
n, a, Ze. and Z, it is bounded from below by —(1 — 7)Nm, as was shown 
in Chapter 8, Theorem 8.1. The nuclear kinetic energy is not needed for 
this bound, and the statistics of the nuclei is irrelevant. If the nuclei hap- 
pen to be fermions, the conclusion would hold even for 7 = 1 if Z is not 
too big, by simply exchanging the role of the nuclei and the electrons in 
Theorem 8.1. 

The remaining part H® contains only gravitating particles; it is equal to 
the Hamiltonian studied in Section 13.2, with an effective coupling constant 
«/n. In particular, this shows that the full Hamiltonian H in (13.3.5) is sta- 
ble as long as x < const. N~*/? (for appropriate values of Z and a). This 
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result holds for fermionic electrons and nuclei of either statistics, boson or 
fermion. 

It remains an open problem to study the semiclassical limit N, M — oo, 
k — Owith Nx*/ fixed. In this limit, one expects the system to be described by 
a semiclassical functional, which is more complicated than (13.3.2) because 
of exchange terms resulting from the Coulomb interaction potentials (see 
Chapter 6). 


CHAPTER 14 


The Thermodynamic Limit for 
Coulomb Systems 


14.1 Introduction 


In the previous chapters we established the fact that the ground state energy is 
bounded below by a constant times the total particle number for a variety of 
models of particles interacting via electrostatic and magnetic forces. The natural 
next question would be whether it is strictly proportional to the particle number 
for large particle number, that is, whether the limit of the energy per particle 
exists. For the ground state energy it is actually easy to see that this is the case, 
as we shall demonstrate in Section 14.2. 

A more interesting question is what happens if we confine a large number of 
particles to a large box, with the number of particles per unit volume, i.e., the 
density g, fixed. This would describe, for instance, a gas or a liquid or even a solid 
in a container. Again we expect the energy, in the limit of large system size, to 
be equal to the particle number times some function of the density, independent 
of the volume or the shape of the box. To make things even more realistic, one 
can discuss the very same question at a positive temperature T > 0, in which 
case the relevant quantity to look at is the free energy, i.e., the energy minus T 
times the entropy. This general question of the existence of the thermodynamic 
limit will be addressed in Section 14.3. 

In the previous proofs of the stability of matter the main concern was the short 
distance |x|~! singularity of the Coulomb potential, and to show that it does not 
cause the system to implode and have a very negative energy that grows faster 
than the particle number. In contrast, it is the long distance |x|~! nature of the 
Coulomb potential that is the source of difficulty for proving the existence of a 
thermodynamic limit. By confining the particles to a box they are prevented from 
escaping from each other, and the danger is that the energy could be positive 
and grow faster than the particle number. For instance, if we put only negatively 
charged particles in a box of diameter L, the particles would stick to the walls 
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of the container, and the energy would grow like N?/L = N*/30'/? for large N 
and fixed g. Clearly, charge neutrality will be an essential input here, whereas 
it was not before. 

Another concern might be that even if the energy is bounded from above and 
below by some constants times the particle number, the energy per particle still 
might oscillate and not converge as N goes to infinity. Although this is physically 
ridiculous it is a mathematical possibility that has to be addressed. 

In the physically realistic case there are several kinds of particles to be con- 
sidered. While there is only one negative particle of importance (the electron’), 
there are several kinds of nuclei, some of which might be bosons and some 
fermions. To simplify the discussion here we shall just take one kind of nucleus 
of charge +Ze and mass jz. We emphasize, however, that the proof given here 
can easily be generalized to include several species of particles, see [116]. 

The discussion begins with the thermodynamic limit for particles of inde- 
terminate density in their ground state, i.e., with no confining box and free to 
be where they like in R*. This would be appropriate for the discussion of a 
solid at very low temperature. Following that we shall give a quick review of 
the essential principles of statistical mechanics needed for the formulation of 
the thermodynamic limit at positive temperature and fixed density. The main 
theorem concerns the existence of the thermodynamic limit of the free energy of 
neutral Coulomb systems. The proof we shall give follows closely the original 
work by Lebowitz and Lieb [103, 116]. Only the Hamiltonians without magnetic 
field will be discussed. We remark that the corresponding question for matter 
interacting with the quantized electromagnetic field is still an open problem. 
Partial results in that direction were obtained in [121]. In Section 14.7 the ‘jel- 
lium’ model is briefly discussed. In this model the nuclei are replaced by a fixed, 
uniformly charged background, chosen to make the system charge neutral. 

The astute reader will notice an important fact about all the systems under 
consideration. Quantum mechanics plays no role in the proof except to provide 
a lower bound to the energy that satisfies stability of the second kind. Thus, the 
proofs go through for classical (i.e., non-quantum) statistical mechanics if there 
is such a lower bound. Indeed, in the jellium model this is the case. We shall 
briefly explain classical statistical mechanics in Section 14.4. For the model with 
electrons and nuclei considered in the rest of this book, quantum mechanics is 


' There are other kinds of negative fermions, like the muon, but they are unstable and have a very 
short lifetime. 
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not needed, as shown by Onsager [145], if we impose a hard core condition that 
|x; — R;| => a > 0 for alli, 7. Quantum mechanics does have an effect on the 
final energy, however. 


14.2. Thermodynamic Limit of the Ground State Energy 


Before considering the general problem of the thermodynamic limit for a given 
density and temperature we first consider the simpler problem of the ground state 
energy in an infinite volume. In the previous chapters, we have shown that this 
ground state energy is bounded from below by a constant times the total particle 
number. Here, we shall supplement this result by showing that the ground state 
energy per particle actually has a limit as the number of particles goes to infinity. 

In contrast to the general case discussed in the next section, charge neutrality 
is not needed here. It was also irrelevant for the question of stability discussed 
in the previous chapters. 

For simplicity, we shall only consider systems without magnetic fields. We 
shall also restrict our attention to just one species of nuclei. The extension to 
more general situations is straightforward. 

The Hamiltonian under consideration will be the same as in Chapters 7 and 8, 
except that we allow for a finite nuclear mass and also take the nuclear kinetic 
energy into account. It is given by 


N M 
H =) °T(p;)+ >, T.(Pj) + aVc(X, R). (14.2.1) 


i=l j=l 


The electrons have positions x; and momenta p,;, while the nuclei have positions 
R ; and momenta P ;. They can be either bosons or fermions. The kinetic energy 
of the electrons is either of the form 


2 
Pp / 
Tn(P) = am or Tn (Pp) = Pp’ =f m?—m 


withm > 0. The same choices apply to the kinetic T,,(P) of the nuclei, with mass 
instead of mass m. A special choice is 44 = oo, which means that the nuclei do 
not move, but are fixed in some minimum energy configuration. The Coulomb 
potential V-(X, R) is defined in (2.1.21), with Z; = Z for j = 1,2,..., M. 
Let Eo(N, M) denote the ground state energy of (14.2.1). We assume, as 
usual, that the electrons are fermions. If they were bosons the ground state 
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energy would grow as —N’/°, which implies that the thermodynamic limit 
exists for bosons only in the sense that the energy per particle is —oo in this 
limit. Real electrons have g = 2, but our conclusions hold for any fixed, finite 
q. In the relativistic case, we shall assume that Za and @ are appropriately 
chosen for stability of matter to hold. Sufficient conditions for this are given in 
Theorem 8.1. 


Theorem 14.1 (Thermodynamic Limit of the Ground State Energy). Let 
Eo(N, M) denote the ground state energy of (14.2.1). Then Eo is subadditive, 
that is 


E(M1 + No, M1 + M2) < E(M, M1) + E(N2, Mz). (14.2.2) 


Now let N; and M; for j = 1,2,... be a sequence such that N; + Mj; — oo 
as j — © and such that N;/(N; + Mj) converges to some number n with 
0 <n < 1. Then there is a function e(n) defined on (0, 1) such that 


Eo(Nj, M; 
lity Oe 


e(n), 14.2.3 
Se Nae (n) ( ) 


regardless of the sequences N; and M; (provided N;/(Nj; + Mj) — n). More- 
over e is bounded and convex on (0, 1), i.e., 


e(Any +(1 —A)n2) < Ae(M1) + A — AJe(N2) (14.2.4) 
forO<A<1. 


Proof. The subadditivity (14.2.2) is a simple consequence of our being able 
to use, as a variational function for the problem with N; + N> electrons and 
M, + M2 nuclei, a product function of the form y! ® wre, where y! is afunction 
for the Nj electrons and M, nuclei, and similarly for Ww. The subscript y € R? 
indicates a translation of w?, ice. Wy(e1, X2,... )=wW(xrtyxot+y,...). 
The expectation value of the Hamiltonian H converges, as y > o, to 
(w!, Hw!) + (W?, Hw’). There is a small technical problem associated with 
the product function y! @ Wy. namely it will not necessarily have the right 
permutation symmetry properties. This however can easily be corrected by 
appropriately symmetrizing or antisymmetrizing this function, which will not 
affect the limiting energy as | y| — oo. 

The assertion (14.2.3) is a direct consequence of (14.2.2) together with the 
stability bound E9(N, M) > ACN + M). To prove it we first note that there is 
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also an upper bound of the form E9(NV, M) < B(N + M) for some finite number 
B. In our case we can obviously take B = 0 but this upper bound can be deduced 
from (14.2.2) alone by noting that Eo(V, M) < NE((1, 0) + ME, 1). 

Since Eo(N, M)/(N + M) is bounded, there are sequences (by passing to 
subsequences if necessary) so that the limit in (14.2.3) exists. Now suppose that 
there are two sequences (N;, M;) and (N’ ',M Ay both with the same limiting 7, 
such that lim j—..9 Eo(N;, Mj IN; +Mj)< lim +00 EQ(N', Mi)/(N; + M’,). 
By going to a subsequence of (N%,, M; ;) and appropriately relaneling: the ie: 
ments, we can assume that 


N; => L;N; Vj 


for some sequence of integers L; that goes to +00 as j > ov, andr; < Nj. If 
we write 


Mj = LjNj +3; 


then lim jo. 6;/(L;Nj) = 0 (since both N;/M; and N;/M; converge to the 
same ratio). By subadditivity and the upper bound on Eo, 


E0(N;, M;) < Lj Eo(Nj, Mj) + Br; + 4)) 
and hence 
Eo(Ni, M’) Eo(N;, Mj 
lim —_ + < lim Bony, 

This contradicts the assumption that the left side be strictly bigger than the right 
side, and hence the two limits agree. 

As for the convexity, we shall prove weak convexity, namely e(4m + +12) < 
Se(n1) + Se(n2). This, together with the boundedness of e(7) easily implies 


convexity (cf. [86, Thm. 1.11]). Let (N;, Mj) and (N?, M;) be sequences con- 
verging to 7; and 72, respectively. By (14.2.2), 


Ey(Nj(N; + M;) + N7(N; + Mj), Mj(N; ae M;) oe M;(N; +M;)) 
< (N} + M?)Eo(N;, Mj) + (N} + Mj)Eo(N?, M3). 


After we divide this inequality by 2(V j + M j )N ; + M;), the left side converges 
to e(sn + 512); and the right side converges to 5e(m) + 5e(12). | 


252 Thermodynamic Limit for Coulomb Systems 


14.3 Introduction to Quantum Statistical Mechanics and 
the Thermodynamic Limit 


In the previous section we have discussed the existence of the thermodynamic 
limit of the ground state energy of systems in infinite volume. We will now turn 
our attention to systems that are confined to finite volumes and have positive 
temperature. The Hamiltonian under consideration will be the same as in the 
previous section. We shall consider only the neutral case here, i.e., N = MZ. 

In order to define an operator restricted to functions that are supported on 
some set Q C R?, we consider the quadratic form? 


N M 


eo) = f [2 ta2rk) + > 1.2K) | [Pk, KPdkaK 


i=1 j=l 


+ a | Vc(X, R)|W(X, R)/?dX dR (14.3.1) 


for w € H'(R°“+™)) (in the non-relativistic case) or w € H!/7(R°4t+”) (in 
the relativistic case) with the property that y~(X, R) = 0 if either x; ¢ Q for 
some i or R; ¢ Q for some j. 

Under the assumption that E(w) > —C(v, w) for all w, ie., stability of the 
first kind, there is a standard method (the Friedrichs extension) of defining 
a corresponding operator Hg such that €(W) = (WwW, He wv). We refer to [150, 
Thm. X.23] for details. In the non-relativistic case, the operator Hg is simply 
given by (14.2.1) with Dirichlet boundary conditions on the boundary of Q, 
i.e., y = 0 on the boundary. Stability of the first kind always holds in the non- 
relativistic case (Theorem 7.1) and it holds in the relativistic case as long as a 
and Za are not too large, as proved in Theorem 8.1 in Chapter 8. 

For bounded Q, the operator Hg has discrete spectrum and a complete set 
of eigenfunctions.’ Moreover, the eigenvalues of Hg (including multiplicity) 
increase fast enough so that e~*”® is trace-class for any 6 > 0, that is, we can 


? We shall slightly differ from the notation in previous chapters and use k for (k,,..., ky) and 
K for (K,,..., Ky). 

3 Spectral theory is not really needed here. We can define the partition function also by a variational 
principle, see Remark 14.2. 
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define the partition function Z(8, Q, N, M) as 


Z8,O.N Mya Te = yeh, (14.3.2) 
E 


where the sum runs over all eigenvalues EF of Hg in the appropriate symmetry 
class (Bose or Fermi). The positive parameter 6 equals 1/kg7, where kg equals 
Boltzmann’s constant and 7 > 0 is the temperature. 


Remark 14.1 (Spin and Symmetry). If the electrons have g spin states then 
the electron coordinates should really be z = (x, o) as before. The spin plays no 
essential role, however, and there is the following easy way to think about spin 
in order to be able to concentrate only on the spatial variables x;. Just pretend 
that there are g kinds of electrons and that each kind has g = 1. That is, there are 
N’ particles of type i (with i = 1,...,q) and the allowed wave functions are 
functions of N', N?,..., N@ spatial variables that are separately antisymmetric 
in each of the N’ variables. We can compute the partition function with given 
numbers N!,..., N4 of particles of each type. After that the partition function 
given in (14.3.2) is computed by summing over all choices of the N’ such 
that )~7_, N' = N. No combinatorial coefficients are needed — as there would 
be if one tried to compute the partition function by summing over the various 
symmetry types of the x variables that can arise from antisymmetry in the (x, a) 
variables. We thus see that it is only necessary to think of g species of ‘spinless’ 
fermions. For simplicity of notation we shall restrict our attention to g = 1 in 
the sequel. 


The following variational principle is essential for our proof of the thermody- 
namic limit [153, Prop. 2.5.4]. 


Lemma 14.1 (Variational Principle for the Partition Function). Let H be 
a self-adjoint operator that is bounded from below, with e~°" trace class. Let 
{Ur,, Wo...} denote a set (finite or infinite) of orthonormal functions. Then 


Tr e~P4 > YS exp[—B(i, H Wi)]- 


In particular, Tr eg 2 equals the supremum of x exp[—6(w;, H w;)] over all 
(finite) sets of orthonormal functions. 
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Proof. By convexity of the exponential function, (Ww, e~?" w) > exp[—B(y, 
Hw)] for any normalized function yw. To see this, let E; and ¢; denote the 
eigenvalues and corresponding orthonormal eigenfunctions of H. Then 


(PW) = Soe PP Ig;, WP = exp -» >> Edi, wr | 


= exp[—B(v, H w)], 


where we have used Jensen’s inequality’ for the convex exponential function, 
as well as >>; |(¢;, W)I? = (W, W) = 1. Since Tr e~8# > > .(y;, e- F4 Wi) if the 
w; are orthonormal (because the trace of a trace-class operator can be computed 
in any orthonormal basis), the claim is proved. a 


Remark 14.2. The variational principle for the partition function can be written 
as 


Tr e 4 = sup S “exp [—BEWi)] 


where the supremum is over all (finite) sets of orthonormal functions. Hence 
it is not necessary to introduce the Hamiltonian operator in order to define the 
partition function. The Hermitian quadratic form €(y) (defined in (14.3.1) for 
the model under consideration) suffices. 


The free energy for a finite domain Q is defined as 


F(B,Q,N,M):= ~5 In Z(6, 2, NM), (14.3.3) 


The specific free energy is the free energy per unit volume, i.e., 
f(B, 2, N, M) := F(B, 2, N, M)/|Q|. (14.3.4) 


Here and in the following, | - | denote the volume or Lebesgue measure of a set 
in R’. 

For the Hamiltonians (14.2.1), the specific free energy is bounded from below, 
independent of the shape or the volume of the domain &. (In the relativistic case, 


+ See the footnote on page 135. 
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we have to assume that Za < 2/m and a is small enough.) This can be seen 
from the following consideration. From the stability of matter, Theorem 7.1 in 
the non-relativistic case and Theorem 8.1 in the relativistic case, we see that, for 
0O<e<l, 


N 
(1 =e) ¥°Tn(p)) + aVc(X, R) = —C(N + M) 
i=l 

for some C > 0 and, in the relativistic case, for ¢ small enough. Hence H > 
€ alr Tin(p;) + eS T,,(P ;) — C(N + M). In particular, from the variational 
principle stated in Lemma 14.1, Z < e&%+CTr exp[—Be >”, Tn(p;) — 
B So T,,(P ;)]. Itis easy to see that the latter trace is bounded from above by 
e?\N+™) for some D > 0, see [153] or any other standard textbook on statistical 
mechanics. 

The thermodynamic limit concerns the behavior of f(6,Q,N,M) as Q 
tends to R? in a suitable sense, and N and M tend to infinity as well, with 
N/|Q2| and M/|Q| converging to a finite non-zero value. In order to define what 
is meant by ‘suitable sense’, we need the following definition. 


Definition 14.1 (Regular Sequence). If 2 C R¢@ is an open set andh > 0, define 
V (h; 2) to be the measure of the set of points x € Q such that the distance of x 
to Q°, the complement of (2, is less than or equal to h. 

Forh < 0, let V(A; 2) denote the measure of the points y € Q° whose distance 
to Q is less than or equal to —A. 

A sequence of bounded open sets Q; C R¢, 7 €N, is said to be a regular 
sequence if the following two conditions hold. 


(1) The sequence is a Van Hove sequence, i.e., limj_,. |{&2;| = oo and 
lim joo V(h; 82;)/|82;| = 0 for all fixed h € R. 

(2) The sequence satisfies the ball condition, i.e., if B; is the smallest ball 
containing Q; then lim inf jo |Qj|/|Bj| > 0. 


The following theorem, which is the main result of this chapter, shows that 
for a regular sequence of domains, the thermodynamic limit of the specific free 
energy exists, and is independent of the sequence. It was proved in [103, 116]. 


Theorem 14.2 (Thermodynamic Limit of the Free Energy). Fix some B > 0 
and @ > 0, and consider a regular sequence of domains Q2; € R¢, anda sequence 
of integers N; such that N;/|Qj;| > @ as j > o. With M; = N;/Z(i.e., in the 
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neutral case), the limit 


fim f(6, 27, Nj, My) = f(B,@) (14.3.5) 


exists and is finite. The limit function f is independent of the sequence. It is a 
convex function of @ and a concave function of 1/. 


The convexity/concavity is important for it is the basis of thermodynamic 
stability. Convexity in @ means that the pressure (defined to be — f + gd f/dQ) 
is anon-decreasing function of 9. (Squeezing a container increases the pressure.) 
If this were not true the system would spontaneously implode. 

Likewise, concavity in T = 1/(kg 6) means that the temperature increases as 
the energy per unit volume, defined to be f — Tdf/0T, increases. If this were 
not so then hot bodies would suck energy away from cold bodies. 


Remark 14.3 (Spin and Symmetry, redux). As explained in Remark 14.1, spin 
can be taken into account by considering the specific free energy of g species of 
‘spinless electrons’, which will then be a function of g densities g' = N'/|Q|, 
i=1,...,q.Inthe thermodynamic limit, this function will be a jointly convex? 
function of (@', ..., @4), and this can be proved in a similar way as convexity of 
Ff(B, @) in Q in Theorem 14.2. It is also obviously a symmetric function of the q 
variables a, ..., 7, which we denote by FB, e. ...,Q7). Itis easy to see that 


FPO) S22 FBO 000"): 
bi e'=e 


This follows from the fact that in the sum of the partition functions over different 
ways of splitting N into q integers, only the largest summand survives in the 
thermodynamic limit. In fact, the error that we make in f when considering only 
the largest contribution is at most g(B|Q|)~!In N, since there are less than N4 
summands. In particular, from convexity and symmetry it follows that 


f(B. oe) = f(B, 0/9, ---, 0/4). 


Several generalizations of Theorem 14.2 are possible, as shown in [116], but 
we shall not give the details here. These concern the following. 


> A function of q variables is jointly convex if f(Ae' + —A)ul,..., Aol + (1 A)p4) < 
RO s 40%; OLA) PO ssa HM!) forO<A <1. 
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e Non-neutral systems. The system does not have to be strictly neutral, but 
the net charge has to be small enough. More precisely, as long as the net 
charge Q is much less than N7/?, the resulting electrostatic energy does not 
contribute in the thermodynamic limit. If Q = ON’? it is necessary to make 
the sequence of domains all have the same shape, e.g., a ball or an ellipsoid. 
Then the limiting specific free energy equals f(B, 0) + e4/? G7 /(2C) where 
C is the electrostatic capacity [118, Sect. 11.15] for the domain of the given 
shape and with unit volume. If @/N*/? — oo then the specific free energy is 
+oo in the thermodynamic limit. 

¢ Multiple species. Instead of having just one species of negative fermions and 
one species of positive nuclei, multiple species can be accommodated. For 
stability it is important that either all the negative or all the positive particles 
be fermions. 

e Short-range forces. In addition to the Coulomb forces among the particles, 
one can include short-range potentials, meaning potentials of the form V(x; — 
x ;) which are integrable at infinity. They may or may not include hard cores. 
In this way one could take ionized atoms as the fundamental particles, or take 
explicitly the form factor of the nuclei into account. 


Historically, the problem of the thermodynamic limit was studied mainly for 
systems with purely short range forces, such as hard sphere particles. Many 
names are associated with this development, including Onsager, Van Hove, 
Bogoliubov, Lee, Yang, van Kampen, Wils, Mazur, van der Linden, Griffiths, 
Dobrushin, Sinai and especially Ruelle [153] and Fisher [65], who evolved what 
came to be the canonical technique for proving the existence of the thermody- 
namic limit in quantum and classical statistical mechanics. The main idea is 
to prove that the specific free energy decreases with increasing volume, except 
for terms of lower order. The thermodynamic limit is then established because 
decreasing sequences have limits. Stability of the second kind makes its appear- 
ance in the assertion that this limit is finite because the sequence is bounded 
below. 

The canonical technique in which big cubic boxes were packed with smaller 
cubic boxes would not work for the long range Coulomb force. The neces- 
sary modification was given in [116] where cubes were replaced by balls, 
and the long-range interaction was ameliorated by the electrostatic screen- 
ing given by Newton’s theorem, Theorem 5.2. The replacement of cubes by 
balls led to the geometric problem of how to pack large balls by smaller balls 
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efficiently. The solution of the problem is presented in Section 14.5. Apart 
from this change from cubes to balls, and the resulting analytic complication 
this change induces, the basic strategy remains that of the canonical method 
mentioned above. 

Recently an alternative proof of the thermodynamic limit (in the grand- 
canonical ensemble) for Coulomb systems was constructed by Hainzl, Lewin 
and Solovej [83]. The strategy there is different from the canonical approach; it is 
shown that the specific free energy increases with volume, except for negligible 
terms. 

The proof of Theorem 14.2 will be given in Section 14.6. It will make essential 
use of the ‘cheese theorem’ discussed in Section 14.5. Before explaining the 
proof of Theorem 14.2 let us remind the reader of the definition of the partition 
function in classical statistical mechanics. 


14.4 A Brief Discussion of Classical Statistical Mechanics 


As explained in Section 2.1, a classical mechanical system is described by 
the Hamiltonian, which is now a real-valued function of the various momenta 
P=(p,,..., py) and positions X = (x,,..., x,y) of the particles. If there are 
N/ particles of species j, with N = a N/, the classical partition function for 
particles in a domain Q is 


1 
TT, Ni! 


e fF =7= 


ho’ | dP / dx ¢ PHLD. (14.4.1) 
Qn 


R3V 


where h is Planck’s constant. It plays the role of a normalization constant which 
makes Z dimensionless. Formula (14.4.1) is suggested by taking a semiclassical 
approximation of the quantum partition function (compare with the discussion 
in Section 4.1.1). The factorial factors result from the quantum-mechanical 
restriction to either symmetric or antisymmetric functions (bosons or fermions) 
which reflects the indistinguishability of particles within the same species. 

If H is of the form )°; p;/(2m;) + V(X), then the P integration can be 
carried out and one obtains 


2mm; \*” 
Z= I] (40) a) (14.4.2) 
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where 


‘conf 1 
Q =e PIO : faxeorm (14.4.3) 
TL 4! 


Qn 


is called the configurational partition function,° and f° is the configura- 
tional part of the free energy per unit volume. 

For an ideal gas, where V=0, Q= |Q\/ I; N/!, and hence, using 
Stirling’s formula, the specific free energy F'/|Q2| becomes 


_ 1 i B7/7h3 9) 
Fade’ (m (Qaemi ') 


in the thermodynamic limit, where m/ denotes the mass of the particles of species 
j, and g/ is the density of these particles. For interacting systems, the existence 
of the thermodynamic limit of the classical specific free energy f = F'/|Q| is 
equivalent to the existence of the thermodynamic limit of the configurational 
part fo, 

Let us now consider the Coulomb systems discussed in the rest of this book, 
where the interaction potential is given in (2.1.21). For such systems, Q = +00, 
because of the attractive 1/|x| potential between electrons and nuclei. This 
means that classically there is no stability of the first kind. On the other hand, 
if we modify the potential slightly stability of the second kind can be obtained. 
The way to do this was discovered by Onsager [145] who introduced a hard- 
core condition. Let us imagine that there is some fixed radius a > 0 such that 
V = +00 unless |x; — R;| = a for alli, 7. For this V, Q is finite and stability 
of the second kind holds. This can be easily seen as follows. If |x; — R;| =a 
for all i and j, we can replace the point charges at x; and R; by smeared out 
charges uniformly distributed over spheres of radius a/2. The electron—nucleus 
attraction does not change under this replacement, whereas the interelectron and 
internuclear repulsions decrease in case the particles are closer to each other 
than a distance a. The total Coulomb energy (including the self-energy of all 
the particles) is positive. Therefore V is bounded below by the negative of the 
self-energy of the charge distributions, namely —(N + M)/a. 


© The configurational partition function is often defined with a prefactor |Q|~" or 1 instead of 


1/7]; Ni. 
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The existence of the thermodynamic limit for this classical hard-core Coulomb 
gas can be shown in essentially the same way as the one in quantum case, which 
we shall discuss in the sequel. To make the following proof work without 
modifications one has to introduce also a hard-core constraint with respect to 
the boundary of (; i.e., the electrons and nuclei have to stay away a distance 
a/2 from the boundary of Q. This does not affect the value of the specific free 
energy in the thermodynamic limit, of course. 

Another Coulomb system for which stability of the second kind holds both 
classically and quantum mechanically is the jellium model, which we shall 
discuss in Section 14.7. There we shall also explain in some detail the differences 
between the proofs in the classical and quantum cases. 


14.5 The Cheese Theorem 


Our goal in this section is to prove that one can find a sequence of balls B;, 
j=0,1,2..., in R?¢ of geometrically increasing radii, Rj = (1 + py with 
p €N, such that every ball B; can be filled with ny = pid + py disjoint 
copies of balls of radius R;_,, with k = 1,2,..., j. If we can do this then the 
filling of the j ball will be exponentially quick. More precisely, if Vj = 04 R’ 
denotes the volume of B; then the unfilled volume fraction of B; is 


Jj J d 
_ _ _1) { Rj-x 
LY, : ) ny Vj-~ = 1— y pela + py (= ) 
k=1 k=1 J 


J J 
Ss = _ = P 
k=1 | + P 


(14.5.1) 


This sequence of balls will be called the standard sequence and will be used 
not only to define a thermodynamic limit but also as a standard to show that an 
arbitrary sequence of suitably behaved domains has the same thermodynamic 
limit. 

The next theorem shows that this sequence can be constructed if p is large 
enough (p > 9 ford = 2 and p > 26 for d = 3). For convenience of exposition 
we reverse the order by starting with a ball of radius 1 and then filling it with 
balls of radii (1 + p)~/ for all j > 1. The theorem has come to be known as the 
‘cheese theorem’ because it shows that a ball of unit radius can have its volume 
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annihilated by carving out balls of ever smaller radius. After a few balls have 
been cut out, the remaining domain looks like Swiss Emmenthaler cheese. 


Theorem 14.3 (Cheese Theorem). Let p be a positive integer satisfying 
14 ps0" 120/40 4-246 (14.5.2) 
(where og denotes the volume of the unit ball in R¢). Let 
rj=(+ py? and nj= pi td + py@, 


It is possible to pack the unit ball in R4 with the disjoint union Uji 
(n; balls of radius r;). It is also possible to pack the cube in R? of volume 
oq with the same Set of balls. 


Proof. The fundamental observation is this: Recall the definition of V(h; Q) in 
Definition 14.1. Let AZ“ denote the hypercubic lattice in R@ with side length 
i. The points in this lattice define hypercubes in R¢ of volume A“, with the 
property that the vertices of these hypercubes are contained in the lattice. 

The number of hypercubes that are contained entirely in Q is at least as large 
as A~4(|Q| — V(Av/d; Q)), where |Q| is the measure of Q. To see this, note that 
the number of hypercubes entirely contained in Q equals A~“ times their total 
volume. The total volume occupied by these hypercubes is at least |(2| minus 
VAVd; Q). 

A simple corollary of this observation is that we can pack at least A~4(|Q| — 
VAVd; Q)) disjoint balls of diameter 4 in Q. 

We prove the theorem for the unit ball; the proof for packing the cube of 
volume oy is essentially the same. Starting with Q equal to the unit ball, we 
easily calculate V(h; Q) = og(1 — (1 — h)“) for0 < h < 1, whichis bounded by 
V(h;3Q) < og((1 +h)? — 1) < ogh(24 — 1). Setting A; = 2/(1 + p), we have 
to check that ny = (1 + p)*! is less than or equal to a7 7(1Q| — V(Aivd;Q)), 
which follows immediately from (14.5.2) using the upper bound on V just 
derived. 

The next step is to insert m2 balls of radius rz on a hypercubic lattice of size 
dy = 2/( + p)’, and use the ‘fundamental observation’ to make sure that they 
will fit. Having done so, we then insert 3 balls of radius r3 on a hypercubic 
lattice of size 43 = 2/(1 + p)* and so forth. At each stage we have to calculate 
VA; rae) j), Where 2; is the cheese left after removing the balls in the previous 
jJ — 1 steps. That is, Q; is of the form Q \ U; b;, where the 6; form a finite 
collection of disjoint balls, namely n; balls of radius 7, withk = 1,..., 7 — 1. 
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We shall overestimate this volume by adding up the individual contributions 
to V(A; Jd; Q2;) from Q and the balls ;, i.e., 


j-l 
VAivd; Qj) < VAjVd;2)+ nerf Vaid 71; Q). 


k=] 


As before, we use V(h;Q) < V(—h;Q) < ogh(24 — 1) for 0 <h < 1. This 
yields 


j-1 
VA; d; Qj) < o4(24 — DAjVd ( + Ynt) 


kel 


Jol 
= 04(24 — DajVd (1 ee r—) 
Pp — 


Moreover, |Qj| = oa1 — j=} ner) = oa(p/C. + p))i-!. The inequality 
nj <A5“(|Q;| — VAjvd; Q;)) thus holds if 


(apap 8 
1 <2oy (1+ p- 04 - Iva +P v ’). 
D a 
Since p > 2 and p!~/ < 1, this inequality holds if and only if it holds for j = 1, 
that is, 


12a, (1 pega 1)2vd) 
which holds true by the hypothesis (14.5.2). | 


As stated before, we want to consider a general sequence of domains and show 
that the thermodynamic limit of the free energy is the same as for the standard 
sequence. The following corollary of the cheese theorem will be important for 
this purpose. 


Corollary 14.1 (Filling General Domains by Balls). Let Q C R¢@ be a bounded 
open set. For p € N satisfying (14.5.2), let R; =(1 + p)! and nj= pita 
p)i@-)., Fix some J € N. Then the following two properties hold. 


(1) Let m denote the largest integer less than or equal to [|Q\| — V( + p)! 
o,!4 Jd; Q)]/(oa + p)!”). Assume that m > 0. Then Q contains the dis- 
joint union ele: (mny_j balls of radius Rj). 
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(2) Let B be any domain containing Q, and let A := B \ Q. Furthermore, 
let ut denote the largest integer less than or equal to [|A| -— V(-d + 
p)'o,/"Vd; Q)- Vd + p)'o,!*./a; B)\/(oa + p)!4). Assume — that 


ft > 0. Then A contains the disjoint union a nj—j; balls of radius Rj). 


The first part of the corollary says that as long as the surface to volume ratio of 
a domain is small, the domain can be packed very efficiently with balls from our 
standard sequence, in fact with the same efficiency as the unit ball can be packed 
with smaller balls. The second part says, in particular, that the complement of 
the domain that lies inside a circumscribed ball can also be packed in a similar 
fashion. 


Proof. By the ‘fundamental observation’ explained in the proof of Theorem 14.3, 
the domain Q contains m disjoint cubes of volume o4(1 + p)/“. According to 
the last statement in the Cheese Theorem 14.3, these can be packed with balls 
as stated. 

The second part of the Corollary is proved in exactly the same way, noting 
that V(h; A) < V(—A; Q) + V(h; B) for h > 0. | 


Corollary 14.1 will be applied to large domains (2 that are sufficiently regular 
in the sense of Definition 14.1. 


14.6 Proof of Theorem 14.2 


To avoid unenlightening complications, we shall assume that Z is an integer, as 
it is in nature. 


14.6.1 Proof for Special Sequences 


Pick an integer p for which the conclusion of the cheese theorem holds. For each 
fixed density 9 > 0 we choose a special sequence of domains (2; of increasing 
volume, with 7 = 0,1,.... This sequence will be the balls described in Sec- 
tion 14.5, except that we choose the radii such that the number of nuclei in the 
volumes are integers. Explicitly, (2; is a ball with volume |Q;| = ZU. + p)*!/e, 
and M; = (1+ p)*/ and N; = Z(1 + p)*/. We denote the radius of Qo by ro, 
ie., (47rg)/3 = Z/o. 

For j > 0, we decompose (2; into a finite set of balls according to The- 
orem 14.3. That is, Q; contains ny = ped + py balls of radius rj_,; = 
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(1+ p)-*ro with k = 1,..., j. Let 4, denote the fraction of the volume occu- 
pied by the balls of size rj_,, 1.e., 
3 k 
ry 1 
bam it = ( . ) (14.6.1) 
ri p\l+p 


Note that }°7° , Ag = 1. 
The fundamental inequality we would like to show is the following. 


Lemma 14.2 (Recursion Relation for the Free Energy). Let f; = f(, 
Q2;, Nj, M;) and let 4; be given in (14.6.1). Then 
J 
f7< OM fie + 00 + py Ajas (14.6.2) 
k=1 


for some constant c > 0 (depending only on @). 


Let us defer the proof of this lemma for the moment, and show how (14.6.2) 
implies the existence of the limit of f; as j — oo. Let d; be defined as 


j 
dj = fi—)- Anfi-n. (14.6.3) 
k=1 
Note that (14.6.2) says that dj < c(1+ py Aj+- The solution to the set of 
equations (14.6.3) is 
j-l 
fpaajty So dk. (14.6.4) 
k=0 
which the reader can easily verify by induction, starting with j = 1. If we write 
jrl j-l j-l 
Yo de = 0 [de — A + py? Acar] +0 + py Do Aes, 
k=0 k=0 k=0 
we can see that the limit as 7 — oo of this sum exists. In fact, the limit of 
the last sum on the right exists (and is finite), and the limit of the first sum 
on the right must exist since all the summands are non-positive. Moreover, it 
must be finite since f; is bounded from below, uniformly in j. This implies that 
lim j-.o0 dj = 0, and hence also f; in (14.6.4) has a limit, which is finite. It is at 
this point that the stability of matter enters to show that the limit is finite instead 
of —oo. 
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We are left with the task of proving (14.6.2), which is the heart of the matter 
and which is the reason for using balls as our fundamental domains. 


Proof of Lemma 14.2. We shall employ the variational principle for the parti- 
tion function in Lemma 14.1. The set of orthonormal functions will consist of 
functions W that are products of functions yw, supported entirely in individual 
balls, or in the complement of all the balls. The functions y, will be of the right 
symmetry type. Strictly speaking, Y has to be appropriately (anti)symmetrized 
among particles in different balls, but this will not have any effect, as we shall 
see. The key idea is to place the particles in the individual balls in such a way 
that the resulting state is invariant under rotations, in which case Newton’s the- 
orem (Theorem 5.2 in Chapter 5) will imply that there is no average interaction 
between different balls, as will be shown. If there were no interaction between 
balls then (14.6.2) would be an equality. 

We write Q; as the union of balls labeled by b= 1,2,..., and their com- 
plement in Q;, which we denote by I';. There are ny = p*~'(1 + p)** balls 
of radius rj_, = (1 + p)i—*ro. Let N” be equal to @ times the volume of the 
ball number b (not to be confused with N;), and M > — N°/Z. The number of 
particles in I’;, the complement of the balls, is then N” = N; — )0, N > and 
MY = NY/Z. 

For every ball b, let y? denote a set of orthonormal functions of N’ + M? 
particles supported in the ball b, with the correct permutation symmetry type. 
Let @ denote the multi-index a = (a!,a”,...). As N j-Particle wave functions 
on &2; we choose products of the form 


We=v" | [ ve. 
b 


where yy” is a fixed normalized function of N” + MY” particles supported in 
the complement of all the balls. Let S denote the operator that projects onto 
the correct permutation symmetry type for all the N; particles. We claim that 
(SWz, SWy) = 0ifa 4a’, and 

(SW5, H SW) 

(SW, SWa) 

for the Hamiltonian (14.2.1). This claim follows from the fact that a permutation 
(of either the electrons or the nuclei) either leaves all the particles in the same 


domain, or it must move at least two particles out of their original domain. In 
the latter case, the one-particle terms in the Hamiltonian have vanishing matrix 


= (Wy, H Wz) 
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elements between the original and the permuted function. The same is true for 
the two-particle terms, simply because they are local.’ (For the one-particle 
terms, locality is not needed, which is important in the relativistic case.) 

In particular, the variational principle in Lemma 14.1 implies that 


exp[—B|Qj|f;] = D— expl-B(Wa, H Va)I.- 


The expectation value of H can be written as® 


ee y ae 
(Ws, HW; = Wen HWA +5 > TO = edly 


2 De! 


¥ 
+’, Hy") +a x ff A (14.6.5) 


where Q? denote the charge density of w?, defined in Eq. (3.1.8) in Chapter 3, 
and likewise for Q”. We note that { Q2(x)dx = [ Q”(x)dx = 0 because of 
charge neutrality in all the balls. 

Jensen’s inequality’ implies that 


do expl-B(Wa, H Ya)] 


> PY Sexp|—B ove Hv) — Bw”, H)], 


where W is the average value of the interaction between different domains, given 
by 


b 
—_ se aff PDP 5 i +f Qe(2)O") 4 ay 


2 tae |x — y| 


(14.6.6) 
with 


-1 
fa = exp[-B vi. Hvd)| bs exp[-B (Wh. H ve) . 


Note that )°; & = 1. 


7 For the notion of locality, see the footnote on page 181. 

8 The symbols H in this equation et seq refer to different particle numbers, which should be clear 
from the context. 

° See the footnote on page 135. 
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Given the set of orthonormal functions 2? for a given ball b, we are at liberty 
to replace them by a new set which is obtained from the original set by rotation 
R» of all the particle coordinates around the center of the ball. For this new 
set, the expectation value of H stays the same, while all the Q2(x) change to 
QO°(R°x). The only effect of these replacements (one for each ball b) will be to 
replace Q?,(x) by Q°,(R°x) for all @ and all b in (14.6.6), while &; remains 
unchanged. 

The important point is that there is a choice of R? such that W < 0. This 
follows from the fact that the average of W over all rotations of all the balls 
equals zero. This is a consequence of Newton’s theorem, since all the terms in 
(14.6.6) vanish for radial Q?. Note that for this argument it is not necessary for 
Q” to be radial. 

We conclude that 


exp [—B|Q,|f;] = exp[-A(W”. HY”) | | » exp[—B(w2, H 2) . 
b a 


Since this inequality holds for any choice of the orthonormal functions 2, the 
variational principle in Lemma 14.1 implies that 


J 
exp [—A|Q;| fi] = exp cv Hw’)-B>- rll ; 


k=1 
Since Ay = ny|Qj—«|/|Qj| according to (14.6.1), this implies (14.6.2) with 


1 


= Y HW"). 
“Tape 


It is left to derive a uniform upper bound on c. The number of particles 
NY” in the complement I’; equals @|Q2;|(1 + p)Aj41. Moreover, according to 
the cheese theorem I; contains 4; = p/(1 + p)Vt) disjoint balls of radius 
(1 + p)~'ro. We place the particles in these balls, with an average number of 
oe(47/3)0 + pyr per ball. By the same argument as above, we can arrange the 
states such that the interaction between balls is negative. The energy (W”, H Ww) 
is then bounded above by a constant (depending on g) times N” which, for 
convenience, we write as a constant times N’(1+ p)* = [2]. + py Aj+1- 
This completes the proof. | 
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14.6.2 Proof for General Domains 


We will now extend the analysis in the previous subsection to an arbitrary 
regular sequence of domains {2;. We choose the particle numbers M; to be 
as close to Q|&2;|/Z as possible, i.e., |Mj; — @|Q;|/Z| < 1. Moreover, N; = 
M,Z because of neutrality. For this choice of particle numbers we shall show 
that f(B, 23, Nj, M;)/|Q;| converges to the same number as for the standard 
sequence of domains. 

Pick an € > 0, and choose J large enough such that for j > J, f;, the specific 
free energy of the standard sequence defined in the previous section, is within 
é of its limiting value as j > oo, which we denote by f (and whose existence 
was proved above). Let K > J. According to Corollary 14.1, item (1), we can 
pack mnx_,j balls of radius (1 + p)/ro into the domain 239 Sway hk =I, 
where m is the largest integer less than or equal to e(1 + ay * (iQ, —-VCi+ 
p)* (4 /3)'3./3; Q,;)]. 


The volume that is occupied by these balls equals 


= Pe. "eas i: (25) 
ar ae pti | 


Because of the assumption that Q2; is a Van Hove sequence, m/|Q ;| converges to 
o(1+ p)?** asj > oo. We thus conclude that lim j_,.. Vj/|Qj| = 1 — (p/U + 
p))*~/ for fixed K and J. 

Proceeding as in the previous subsection and using that f; < ftefors < 
j < K, we have 


F(B, 23, Nj, Mj) < ah +9) + 5M — @Vj) 
for some C > 0 (depending only on g and p). The last term is the contribution 
from the N; — @V; particles outside the balls. Note that N; — @V; equals @ 
times the volume of the domain plus or minus one particle. By putting these 
particles into appropriate smaller balls, as explained at the end of the previous 
subsection, their contribution to the free energy is easily seen to be proportional 
to the total particle number in this domain. 
We conclude that 


K-J K-J 
isin 2). Na) =F +0(1— (<2) )-¢0(") 
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Since K can be made arbitrarily large and ¢ is arbitrarily small, this proves that 


lim sup f(B, Q;, Nj, Mj) < f. 


For the lower bound, we proceed in a similar way, this time using item (2) of 
Corollary 14.1. For given j, we choose 8; to be the smallest ball containing Q; 
and whose radius equals (1 + p)"ro for some integer n. For fixed J < K chosen 
as before, such that | f; — f| <e for all j > J, B; \ 8; can be packed with 
nx; balls of radius (1 + p)/ro, with j = J,..., K —1, where yw equals 
the largest integer less than or equal to o(1 + p)**[ IB; \ Q3| -V(- + 
p)¥ (42¢/3)'/3./3; Q;) — V+ p)* (427/3)'7./3; B;)]. The total volume of 


these balls is now 


in ae ia 
V;=—(1 =| = . 
: a ac ( (4) 


We consider the free energy for the domain B; with e|8;| particles. Again by 
putting the particles appropriately into the balls, we conclude that 


IBF — ©) < IQ;|f(B, Q;, Nj, Mj) + Vif — £) + C (0|Bjl| — Nj — eVj). 


The last term comes from particles outside (2; and all the balls. It is important 
that these particles also be placed appropriately in balls in order to make sure 
that their Coulomb interaction with particles in the other domains vanishes. 

The regularity of the sequence Qj; implies that (|B;| — Vj)/|Q;| > 1 — 
(p/(1 + p))k~/ as j — oo. Moreover, (Bj| + Vj)/|2;| < C’ for some con- 
stant C’ independent of 7. Hence 


K-J K-J 
e ‘ fF = a : P 
linint (6.0, Ne MOS Fl a 
imint f(B, 2j,.Nj. Mj) i( (44) : e( Fi) 


Since K and ¢ are arbitrary, this concludes the proof of the fact that 


fin FCN, MHF 


for any regular sequence of domains Q2; and appropriately chosen particle num- 
bers satisfying |Mj; — @|2j;|/Z| < 1. 


270 Thermodynamic Limit for Coulomb Systems 


14.6.3 Convexity 


For each fixed £ and g, the specific free energy has a thermodynamic limit, 
which does not depend on the chosen sequence of domains. We denote it by 
f(B, @). In this subsection, we shall show that f is a convex function of @ and 
a concave function of 1/6. 

We first show convexity in 9. Fix0 < 4 < 1. Wechoosea sequence of domains 
Q2;, such that every (2; consists of two disjoint balls Qi and Q5 of volume V} 
and V; , respectively, with the property that 


1 
j . 

——,; =) forall j. 

Vee ; 

This sequence of domains satisfies the regularity condition of Definition 14.1. 

Choose M} and M7 such that |Mj — a1V}/Z| < 1/2 and |Mj — @2V?/Z| < 

1/2. Then 


1 1 
ahaa =(1=Doo| <1. 
v 7 oi — ( )o2| < 


By locating N; ! particles in the ball Qi and N; > particles in the ball O%, we 
conclude (using the variational principle anal N ewton’ s theorem) that 


f(B, 25, Nj + Nj, Mj + Mj) <Af(B, Q5,.Nj, Mj) 
+(1—A)f(B, 2),.N;,Mj). (14.6.7) 


All the three quantities converge in the limit j —> oo to f with the respective 
densities. That is, in the limit 7 — oo (14.6.7) becomes 


F(B, 401 +1 — Aor) S AF(B, 01) +1 — A) FCB, 02). 


This proves convexity of f in g. In particular, f is a continuous function of @, 
since all bounded convex functions are continuous. 

The concavity of f in 1/6 follows easily from the variational principle, 
Lemma 1|4.1. Since In}; exp[—6(¢;, H¢;)] is convex in B, In Z is also convex 
in 8. Hence —B~' In Z is concave in 1/, for every given domain and particle 
number. The pointwise limit of concave functions is concave. 


14.7 The Jellium Model 271 


14.6.4 General Sequences of Particle Numbers 


In order to conclude the proof of Theorem 14.2, we still have to show that 
f(B, 2;, Nj, M;) converges to f(£, o) for arbitrary sequences of particle num- 
bers satisfying limj—,.. N;/|Qj;| = @. Our main tool will be the monotonicity 
of the free energy in the size of the domain. 

For a given Q and t > 0, let tQ denote the scaled domain. The func- 
tion tr F(B,tQ, N, M) is clearly monotone decreasing in t, by the varia- 
tional principle, Lemma 14.1. In particular, since 9; := Nj;/|Qj;| < e+e) 
for e > 0 and large enough j, we have F(B, 8;, N;,Mj;) < F(B,loj;/(ed + 
€))]Q2;, Nj, M;). (Note that N;/[o;/(eU + €))|Q;|] = eC + e), independently 
of j.) The latter quantity, divided by |{2;|, converges to (1 + e) | f(B, od + 8)) 
as j > OO. 

In particular, 


1 1 . 
lim sup —— F(f, Q;, N;, M;) < —— ,o(1+6)). 
msup SFB. 2j..Nj Mi) = FB. aU +) 


In the same way one shows that 


1 1 . 

lim inf —— F(B, 2), Nj, Mj) = —— f(B, eC — €)). 
jroo |Q5| : l-e 

Since ¢ is arbitrary and f is continuous in Q, this proves the claim. Recall that 

the continuity of f follows from its convexity, which was shown in the previous 

subsection. a 


14.7 The Jellium Model 


The jellium model was invented by Wigner [187] as a simplified model of 
a solid. In this model there is only one kind of particle and they move in a 
fixed, uniform background of electric charge of the opposite sign. In Wigner’s 
interpretation of the model the particles were nuclei and the background was 
formed by the sea of electrons in a solid. In the absence of quantum mechanics 
and the uncertainty principle the nuclei will locate themselves at positions that 
minimize the total energy. Presumably they will form a regular lattice, but this 
has never been proved. In the quantum case the particles can be expected to 
form a lattice only when the density is small, so that the kinetic energy, [ |Vw le 
will not get in the way of localization. A phase transition is supposed to occur 


272 Thermodynamic Limit for Coulomb Systems 


from a smeared out state at high density to a localized lattice state at low 
density. 

Here, we shall continue with our previous convention and call the moveable 
particles electrons. The background will have uniform positive charge density 9 
in some domain 2. The number of electrons is N = @ |&2| for charge neutrality, 
and they are confined to the domain (2. The Hamiltonian is 


N 
H = Yo (5p? awa) +01) +0. (14.7.1) 


i=1 


1 
UG Nees y 


MD= D) | aT 


1<i<j<N 


eR ire sera) We (14.7.2) 


This Hamiltonian is appropriate both in the classical and the quantum cases. In 
the latter p* is to be interpreted as —A with Dirichlet boundary conditions on 
Q. 

Note that -AW(x) = 470 forx € Q while -AW(x) = Oforx ¢ Q. If Qis 
a ball of radius R we can compute W. Namely, W(x) = 1 0(6R* — 2|x|*)/3 for 
|x| < R, while W(x) = 4s R°0/(3|x|) for |x| > R. 

Even if we omit the kinetic energy term there is stability of the second 
kind. This can be seen by replacing each charge —./a at x; by the same charge 
uniformly distributed in a small ball of radius r centered at x;. (Compare with the 


where 


discussion of the Onsager hard-core gas in Section 14.4.) By Newton’s theorem 
the interaction of the small balls is smaller than |x; — or ae The interaction 
of the ball with the background is a little less negative than it is for a point, 
but only a little. The reader can easily work out that the optimum value of r is 
r = (3/47@)'/? and the energy is bounded from below by —0.9 aN (47r0/3)!3. 
(For details of the calculation see [124].) 
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The existence of the thermodynamic limit of the free energy, in analogy with 
our Theorem 14.2, was proved by Lieb and Narnhofer [124]. We shall content 
ourselves with just outlining the essential new feature that has to be considered 
for this system. This concerns the distribution of the particles into the subdomain 
of a big ball that is not covered by smaller balls. 

Before we do so, let us point out that this system is not thermodynamically 
stable in the following sense. Theorem 14.2 states that the limiting free energy 
is convex in @ and concave in 1/6 and we explained the physical significance of 
these properties. While the concavity in 1/6 will continue to hold in the jellium 
model, the convexity in @ will not hold, generally. The reason for lack of o 
convexity is that the background is held fixed like the membrane of a drum. If it 
were allowed to adjust itself by changing its size, and thereby its charge density, 
in order to minimize f, then convexity would be restored. 

Now we turn to the proof of the existence of the thermodynamic limit of the 
specific free energy, the main point of which is to prove the limit, at fixed g, 
for the standard increasing sequence of balls. For the standard sequence, we 
shall give the complete proof in the classical case and remark on the necessary 
modification in the quantum case at the end of this section. 

Let us first discuss the classical case. We take a ball B; and fill it with balls 
B;-1, ..., Boas before. There will be a certain part of B; that is not covered by 
the smaller balls and we call this region D;. In each subdomain we put exactly 
the right number of electrons so that the density is the given 9, namely the 
number of electrons equals 9 times the volume of the subdomain. This means 
that in doing the configurational integral for the partition function, we integrate 
only over that subset of Q% where the number of particles in each subdomain 
is precisely g times the volume of the ball. This obviously gives a lower bound 
on Q. The number of ways we can distribute the particles in the subdomains 
is the multinomial coefficient N! divided by the product of the factorials of the 
individual particle numbers in the subdomains. 

The peculiar region Dj; also contains electrons at density @. We have to 
construct the filling this way, for if we had tried to put all the electrons into the 
small balls, with none in Dj, the electrostatic energy of such a configuration 
would be significantly too high, even for large j where the fraction of the volume 
covered by the balls is very close to 1. In the proof of Theorem 14.2, we confined 
these particles to smaller balls of size B_,. In the jellium case we can not do 
that; the resulting Coulomb energy in D; would be too big since the charge 
distribution would then be far from uniform. 
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If we could eliminate the Coulomb interaction among the different subdomains 
we would obtain the following upper bound on the configurational free energy: 


j 
ia < » ey +p dj OF (14.7.3) 
kel 


where f; cont is the configurational specific free energy for the domain B; a 
and $j; is the configurational specific free energy of D;. Note that pA; = 
1 — S°/_) Ax = |Dj;|/|B;|. The factorials of the particle Humber coming fii 
the permutations of the particles exactly compensate the denominators appearing 
in the definition (14.4.3) of the configurational partition function. 

We claim that Eq. (14.7.3) is valid even in the presence of the Coulomb 
interactions between the different subdomains. The integral over configurations 
of particles within each ball contains an average over rotations of any given 
configuration. By Jensen’s inequality and convexity of the exponential function 
(see the footnote on page 135) the average of the exponential function can be 
bounded from below by the exponential of the average. Since all the subdomains 
except for one (namely, D;) are neutral balls, the average over rotations of the 
interaction energy between different subdomains is zero by Newton’s theorem. 
In this way, we arrive at the fundamental inequality (14.7.3). We note that the 
argument given here is the classical version of the argument in the quantum case 
given in Lemma 14.2. 

It remains to find an upper bound to ¢;. We shall show that (for large 7) 


oj) < gine =i); (14.7.4) 


This follows again from Jensen’s inequality, this time applying it to the integra- 
tion over all configurations of the electrons in D;. The average Coulomb energy 
within D; is negative when all the electrons are smeared out uniformly over D; 
(namely the number of electrons in D; times the negative of the self-energy of a 
single electron smeared out over D;), and hence the integral of the exponential 
factor is bounded from above by | D;|° IDjl / (@|D;\)!. An application of Stirling’s 
formula yields (14.7.4). 

By our previous analysis of inequalities of the type (14.7.3), we know that a 
limit exists for fj cont as j —> oo. This finishes the proof in the classical case for 
the standard sequence. The proof for general sequences of domains and particle 
numbers parallels that in Section 14.6. For details we refer the reader to [124]. 
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The proof in the quantum case has the same structure. The only new difficulty 
comes from estimating @;, the specific free energy of D;. We can not just 
smear the electrons over the domain Dj, as we did in the classical case, for 
two reasons. One is the Dirichlet boundary conditions on D; which prevents 
the particles from getting too close to the boundary of Dj, and the other is 
the antisymmetry requirement on the allowed wave functions, which also raises 
the kinetic energy. These are technical rather than conceptual problems and are 
overcome in [124, Sect. 5.1]. 
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